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A METHOD FOR DETECTION OF COLORECTAL CANCER IN HUMAN SAMPLES 



Field of invention 



The present invention relates to a method of diagnosing colorectal cancer in human 
samples using several novel protein markers. Differential expression pattern of these 
5 markers are indicative of a person having colorectal cancer and/or predictive of the stage 
of the disease in a colorectal cancer patient. 



Background 

Colorectal cancer is one of the world's most common cancers and the second leading cause 
10 of death due to cancer in the western world. Investigations of colorectal cancer show that 
most colorectal cancers develop from adenomatous polyps. The polyps are usually small 
and pre-neoplastic growths that develop on the lining of the colon and can over time 
progress into colorectal cancer. Colorectal cancer occurs as a result of a sequence of 
mutations during a long period of time and these mutations mark the several different 
15 pathological stages of the disease. A model put forward by Fearon and Vogelstein 

describes colorectal cancer progression from normal epithelia to metastasis through the 
phases of dysplasia, early, intermediate and late adenoma and carcinoma. 

A rare, inherited condition called familial polyposis (FAP) causes hundreds of polyps to 
20 form in the colon and rectum and unless this condition is treated, FAP is almost certain to 
lead to colorectal cancer. These individuals are therefore in a special need for an accurate 
screening test, where biopsies can be taken from a polyp during colonoscopy and analysed 
for neoplastic changes. 

25 Several mutations in oncogenes and tumour-suppresser genes have been identified in 
colorectal cancers and some of them have been associated with the phases of the disease 
mentioned above. 

The risk factors for developing colorectal cancer seem to be age, diet, colon polyps, 
30 personal medical history, family medical history and inflammatory bowel disease 
(Ulcerative colitis and Crohn 's disease). 

Colorectal cancer incidences and mortality rates increase with age and sharply so after the 
age of 60. It is estimated that more than one-third of colorectal cancer deaths could be 
35 avoided if people over the age of 50 had regular screening tests, since over 90% of all 
cases occur in people 50 and older. This is due to the fact that colorectal cancer is one of 
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the most preventable cancers, if it is detected at its early stages. If screening tests were 
performed on the risk groups for colorectal cancer, it could help to prevent deaths due to 
the disease by finding pre-cancerous polyps so they can be removed before they turn into 



cancer. 

5 



Studies have shown that women with a history of cancer of the ovary, uterus, or breast 
have a somewhat increased chance of developing colorectal cancer. The risk of developing 
colorectal cancer the second time seems to be evident as well. So these findings suggest 
that personal medical history seems to be relevant in terms of the assessment of risk for 
10 colorectal cancer. The same seems to be true for family medical history. First-degree 
relatives (parents, siblings, children) of a person who has had colorectal cancer are 
somewhat more likely to develop this type of cancer themselves. Ulcerative colitis is a 
chronic condition where the lining of the colon becomes inflamed and persons having this 
condition are considered at a greater risk of developing colorectal cancer than others 

15 

The usual diagnostic methods for colorectal cancer are procedures such as sigmoidoscopy 
and colonoscopy, that involve looking inside the rectum and the lower colon 
(sigmoidoscopy) or the entire colon (colonoscopy) and allowing for removal of polyps or 
other abnormal tissue for examination under a microscope. A polypectomy is the removal 

20 of polyp(s) during a sigmoidoscopy or colonoscopy, which is a procedure often performed 
on individuals suffering from FAP and individuals with sporadic, recurrent colorectal polyps 
Another way is to do X-rays of the large intestine, which is a technique that can reveal 
polyps or other changes in the intestine. A much less cumbersome method, but less 
indicative, is the faecal occult blood test (FOBT). It is a test used to check for hidden blood 

25 in the stool, as it has been observed that cancers or polyps can bleed, and FOBT is able to 
detect small amounts of bleeding in the stool. 

The potential use of mass spectrometry as an aid for diagnosing cancer has been 
demonstrated in WO 01/25791 A2, disclosing protein markers from prostate cancer 
30 patients being differently expressed as compared to samples from healthy subjects or 
patients with benign prostate hyperplasia (BPH). 

Several studies describe usefpl markers for the diagnosis of colorectal cancer. US 
6,455,668 describes a screening method for identifying bioactive agents being capable of 
35 binding to a colorectal cancer modulating protein (BCMP). It further describes a method for 
screening drug candidates, wherein a gene expression profile is used including CJA8 or 
fragments thereof. The expression profile can further include markers selected from the 
group consisting of CZA8, BCX2, CBC2, CBC1, CBC3, CJA9, CGA7, BCN5, CQA1, BCN7 
CQA2, CGA8, CAA7 and CAA9 (WO 00/55633). Another publication, US 2001/0044113 
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describes the use of PKC isozyme, in combination with more conventional cancer markers 
such as bcl-2, bax and c-myc, to detect changes in colonocyte gene expression associated 
with early stages of colon tumorlgenesis by isolation of poly A+ RIMA from faeces. It should 
also be mentioned that the use of an undefined Defensin-polypeptide (Defensin-X) in 
5 diagnosing cancer is described in WO 99/11663. 

There is, however, still unmet need for a simple diagnostic and/or prognostic test to 
provide an indication of whether or not an individual has colorectal cancer. It would also be 
of tremendous help to have a test giving indication of the status during surveillance of the 
10 disease. 



Summary of the Invention 

The present invention relates to a method of diagnosing colorectal cancer in a sample 
using novel protein markers. The markers have been identified by assaying a number of 
15 tissue and serum samples from healthy individuals and persons diagnosed with colorectal 
cancer by means of protein chip technology using mass spectrometry. 

Differential expressions patterns of these markers are indicative of a person having 
colorectal cancer and/or predictive of the stage of the disease in a colorectal cancer 
20 patient. The diagnosis is based on comparing at least one intensity value, obtained using 
the method, to a reference value. 



Detailed description of the invention 



It is an object of preferred embodiments of the present invention to provide a method for 
25 diagnosing colorectal cancer in a sample from a mammal, the method comprising obtaining 
a sample from said mammal and assaying said sample by a quantitative detection assay, 
and determining the intensity signal of at least one marker. 

In this text the words protein, peptide, polypeptide are used interchangeably, and all 
30 describe a chain of amino acids. In some cases the chain of amino acids have so called 
post translational modifications or bind certain ligands (for example ions). In some cases 
the chain of amino acid is a full-length (native) protein, in some cases it is a smaller 
fragment of a full-length protein. The mass values correspond solely to the measured 
mass. 

35 
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The present invention relates to a number of markers. The at least one marker, such as 
two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, 
fifteen, sixteen, seventeen, eighteen, nineteen, twenty, twenty-one, twenty-two, twenty- 
three, twenty-four, twenty-five, twenty-six, twenty-seven, twenty-eight, twenty-nine, 
5 thirty, thirty-one, thirty-two, thirty-three, thirty-four, thirty-five, thirty-six, thirty-seven, 
thirty-eight, thirty-nine, forty, forty-one, forty-two, forty-three, forty-four, forty-five, forty- 
six, forty-seven, forty-eight, forty-nine, fifty, fifty-one, fifty-two, fifty-three, fifty-four 
fifty-five, fifty-six, fifty-seven, fifty-eight, fifty-nine, sixty, sixty-one, sixty-two, sixty-three 
sixty-four, sixty-five, sixty-six, sixty-seven, sixty-eight, sixty-nine, seventy, seventy-one 
10 seventy-two, seventy-three, seventy-four, seventy-five, seventy-six, seventy-seven, 
seventy-eight, seventy-nine, eighty, eighty-one, eighty-two, eighty-three, eighty-four 
eighty-five, eighty-six, eighty-seven, eighty-eight, eighty-nine, ninety, ninety-one, ninety- 
two, ninety-three, ninety-four, ninety-five, ninety-six, ninety-seven, ninety-eight, ninety- 
nine, hundred, hundred and one, hundred and two, hundred and three, hundred and four 
15 hundred and five, hundred and six, hundred and seven, hundred and eight, hundred and ' 
nine, hundred and ten, hundred and eleven, hundred and twelve, hundred and thirteen 
hundred and fourteen, hundred and fifteen, hundred and sixteen, hundred and seventeen 
hundred and eighteen, hundred and nineteen, hundred and twenty, hundred and twenty- ' 
one, hundred and twenty-two, hundred and twenty-three, hundred and twenty-four 
20 hundred and twenty-five, hundred and twenty-six, hundred and twenty-seven, hundred 
and twenty-eight, hundred and twenty-nine, hundred and thirty, hundred and thirty-one 
hundred and thirty-two, hundred and thirty-three, hundred and thirty-four, hundred and' 
thirty-five, hundred and thirty-six, hundred and thirty-seven, hundred and thirty-eight 
hundred and thirty-nine, hundred and forty, hundred and forty-one, hundred and forty- 
25 two, hundred and forty-three, hundred and forty-four, f hundred and orty-five, hundred 
and forty-six, hundred and forty-seven, hundred and forty-eight, hundred and forty-nine 
and hundred and fifty markers, can be selected from the group consisting of the 
polypeptides having apparent molecular weight of 66800 Da, 66500 Da, 66300 Da, 64860 
Da, 60730 Da, 60500 Da, 60475 Da, 46000 Da, 45500 Da, 44300 Da, 33000 Da, 28040 
30 Da, 28025 Da, 28010 Da, 28000 Da, 27700 Da, 19966 Da, 19900 Da, 19865 Da, 16150 
Da, 15935 Da, 15580 Da, 15200 Da, 15140 Da, 14470 Da, 14300 Da, 14100 Da, 14030 
Da, 13870 Da, 13747 Da, 11723 Da, 13700 Da, 13331 Da, 13265 Da, 12000 Da 11989 
Da, 11987 Da, 11900 Da, 11700 Da, 11650 Da, 11550 Da, 11500 Da, 11133 Da, 11080 
Da, 10830 Da, 9950 Da, 9700 Da, 9600 Da, 9197 Da, 9140 Da, 9090 Da, 9079 Da 8971 
35 Da, 8940 Da, 8931 Da, 8930 Da, 8652 Da, 8580 Da, 8230 Da, 7469 Da, 7324 Da 7023 
Da, 6880 Da, 6850 Da, 6660 Da, 6650 Da, 6635 Da, 6450 Da, 6436 Da, 6435 Da,' 6430 
Da, 6125 Da, 6110 Da, 6090 Da, 5920 Da, 5906 Da, 5905 Da, 5900 Da, 5871 Da, 5857 
Da, 5540 Da, 5360 Da, 5330 Da, 5266 Da, 5260 Da, 5234 Da, 5075 Da, 4977 Da' 4749 
Da, 4660 Da, 4640 Da, 4634 Da, 4500 Da, 4480 Da, 4460 Da, 4330 Da, 4300 Da' 4290 
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Da, 4281 Da, 4270 Da, 4266 Da, 4264 Da, 4168 Da, 4136 Da, 4039 Da, 4024 Da, 4000 
Da, 3984 Da, 3980 Da, 3960 Da, 3895 Da 3882 Da, 3878 Da, 3816 Da, 3777 Da/3712 
Da, 3680 Da, 3651 Da, 3574 Da, 3570 Da (def 2), 3487 Da, 3480 Da (def 3),3450 Da (def 
1), 3444 Da, 3408 Da, 3372 Da, 3280, 3275 Da, Da, 3160, Da, 2960 Da, 2955 Da, 2933 
5 Da, 2878 Da, 2850 Da, 2840 Da, 2799 Da, 2693 Da, 2462 Da, 2450 Da, 2364 Da, 2330 
Da, 2275 Da, 2230 Da, 2210 Da, 1945 Da,1930 Da, 1688 Da, 1536 Da, 1365 Da,1256 
Da, 1042 Da, 1026 Da, and 1005 Da. 

Thereafter, the method in a preferred embodiment comprises comparing said intensity 
10 signal(s) with reference value(s) and identifying whether the intensity signal of at least one 
marker from the sample is significantly different from a reference value. 

It is an object of the present invention to provide a method of diagnosing colorectal cancer 
in a sample from a mammal. The method comprises obtaining a sample from said 
15 mammal, detecting in the sample from the mammal at least one marker by a quantitative 
detection assay and determining the intensity signal of the least one marker, wherein the 
marker is selected from the group consisting of the polypeptides having apparent 
molecular weight of: 

20 66800 Da, 66500 Da, 66300 Da, 64860 Da, 60730 Da, 60500 Da, 60475 Da, 46000 Da, 
45500 Da, 44300 Da, 33000 Da, 28040 Da, 28025 Da, 28010 Da, 28000 Da, 27700 Da, 
19966 Da, 19900 Da, 19865 Da, 16150 Da, 15935 Da, 15580 Da, 15200 Da, 15140 Da,' 
14470 Da, 14300 Da, 14100 Da, 14030 Da, 13870 Da, 13747 Da, 11723 Da, 13700 Da' 
13331 Da, 13265 Da, 12000 Da 11989 Da, 11987 Da, 11900 Da, 11700 Da, 11650 Da,' 

25 11550 Da, 11500 Da, 11133 Da, 11080 Da, 10830 Da, 9950 Da, 9700 Da, 9600 Da, 9197 
Da, 9140 Da, 9090 Da, 9079 Da, 8971 Da, 8940 Da, 8931 Da, 8930 Da, 8652 Da, 8580 
Da, 8230 Da, 7469 Da, 7324 Da, 7023 Da, 6880 Da, 6850 Da, 6660 Da, 6650 Da, 6635 
Da, 6450 Da, 6436 Da, 6435 Da, 6430 Da, 6125 Da, 6110 Da, 6090 Da, 5920 Da, 5906 
Da, 5905 Da, 5900 Da, 5871 Da, 5857 Da, 5540 Da, 5360 Da, 5330 Da, 5266 Da, 5260 

30 Da, 5234 Da, 5075 Da, 4977 Da, 4749 Da, 4660 Da, 4640 Da, 4634 Da, 4500 Da, 4480 
Da, 4460 Da, 4330 Da, 4300 Da, 4290 Da, 4281 Da, 4270 Da, 4266 Da, 4264 Da, 4168 
Da, 4136 Da, 4039 Da, 4024 Da, 4000 Da, 3984 Da, 3980 Da, 3960 Da, 3895 Da'3882 
Da, 3878 Da, 3816 Da, 3777 Da, 3712 Da, 3680 Da, 3651 Da, 3574 Da, 3570 Da (def 2), 
3487 Da, 3480 Da (def 3),3450 Da (def 1), 3444 Da, 3408 Da, 3372 Da, 3280, 3275 Da,' 

35 Da, 3160, Da, 2960 Da, 2955 Da, 2933 Da, 2878 Da, 2850 Da, 2840 Da, 2799 Da, 2693 
Da, 2462 Da, 2450 Da, 2364 Da, 2330 Da, 2275 Da, 2230 Da, 2210 Da, 1945 Da,1930 
Da, 1688 Da, 1536 Da, 1365 Da, 1256 Da, 1042 Da, 1026 Da, and 1005 Da. 
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The method further comprises comparing said intensity signal(s) with reference value(s) 
and identifying whether the intensity signal of at least one marker from the sample is 
significantly different from the reference value for said marker. 

5 In one aspect of the present invention a method is provided for diagnosing colorectal 
cancer by means of a serum sample from a mammal. The method comprises obtaining a 
serum sample from said mammal, detecting in the serum sample from the mammal at 
least one marker by a quantitative detection assay and determining the intensity signal of 
the at least one marker, wherein the marker is selected from the group consisting of the 
10 polypeptides having apparent molecular weight of: 

66500 Da, 60500 Da, 46000 Da, 45500 Da, 44300 Da, 28040 Da, 27700 Da, 33000 Da 
19900 Da, 16150 Da, 15935 Da, 15580 Da, 15200 Da, 15200 Da, 13700 Da, 11900 Da, 
11700 Da, 11650 Da, 11550 Da, 11500 Da, 11080 Da, 10830 Da, 9140 Da, 8940 Da, 
15 8930 Da, 8230 Da, 6880 Da, 6650 Da, 6660 Da, 6450 Da, 6430 Da, 6125 Da, 6110 Da 
6090 Da, 5920 Da, 5900 Da, 5540 Da, 5330 Da, 5260 Da, 4660 Da, 4640 Da, 4460 Da' 
4330 Da, 4300 Da, 4290 Da, 4000 Da, 3980 Da , 3960 Da, 3680 Da, 3280 Da, 3275 Da, 
Da, 3160 Da, 2955 Da, 2450 Da, and 1536 Da. 

20 The method further comprises comparing said intensity signal(s) with reference value(s) 
and identifying whether the intensity signal of at least one marker from the sample is 
significantly different from the reference value for said marker. 

In another aspect of the present invention a method is provided for diagnosing colorectal 
25 cancer in a tissue sample from a mammal. The method comprises obtaining a tissue 

sample from said mammal, detecting in the tissue sample from the mammal at least one 
marker by a quantitative detection assay and determining the intensity signal of the at 
least one marker, wherein the marker is selected from the group consisting of the 
polypeptides having apparent molecular weight of: 

30 

15140 Da, 11989 Da, 11987 Da, 9700 Da, 9600 Da, 9197 Da, 9079 Da, 8971 Da, 8652 
Da, 8580 Da, 7324 Da, 7023 Da, 5871 Da, 5857 Da, 5360 Da, 5234 Da, 5075 Da, 4749 
Da, 4634 Da, 4281 Da, 4266 Da, 4168 Da, 4039 Da, 4024 Da, 3984 Da, 3878 Da, 3777 
Da, 3712 Da, 3651 Da, 3574 Da, 3487 Da, 3444 Da, 3408 Da, 3372 Da, 2933 Da 2878 
35 Da, 2840 Da, 2799 Da, 2693 Da, 2462 Da, 2364 Da, 2330 Da, 1930 Da, 1688 Da, 1365 
Da, 1256 Da, 1042 Da, 1026 Da, and 1005 Da, 
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the method further comprises comparing said intensity signal(s) with reference value(s) 
for said marker(s) and identifying whether the intensity signal of at least one marker from 
the sample is significantly different from the reference value. 



5 In yet another aspect of the present invention a method is provided for diagnosing 
colorectal cancer by means of a plasma sample from a mammal. The method comprises 
obtaining a plasma sample from said mammal, detecting in the plasma sample from the 
mammal at least one marker by a quantitative detection assay and determining the 
intensity signal of the at least one marker, wherein the marker is selected from the grou 
10 consisting of the polypeptides having apparent molecular weight of: 

66800 Da, 66500 Da, 66300 Da, 64860 Da, 60730 Da, 60475 Da, 19966 Da, 19865 Da, 
14470 Da, 14300 Da, 14100 Da, 14030 Da, 13870 Da, 13747 Da, 11723 Da, 9950 Da, 
8931 Da, 7469 Da, 6635 Da, 6435 Da, 5905 Da, 5266 Da, 4977 Da, 4480 Da, 4136 Da, 
15 and 3895 Da, 



the method further comprises comparing said intensity signal(s) with reference value(s) 
for said markers and identifying whether the intensity signal of at least one marker from 
the sample is significantly different from the reference value for said marker. 

Another embodiment of the present invention provides a use of at least one marker 

selected from the group consisting of the polypeptides having apparent molecular weight 
of 



25 66800 Da, 66500 Da, 66300 Da, 64860 Da, 60730 Da, 60500 Da, 60475 Da, 46000 Da, 
45500 Da, 44300 Da, 33000 Da, 28040 Da, 28025 Da, 28010 Da, 28000 Da, 27700 Da, 
19966 Da, 19900 Da, 19865 Da, 16150 Da, 15935 Da, 15580 Da, 15200 Da, 15140 Da,' 
14470 Da, 14300 Da, 14100 Da, 14030 Da, 13870 Da, 13747 Da, 11723 Da, 13700 Da, 
13331 Da, 13265 Da, 12000 Da 11989 Da, 11987 Da, 11900 Da, 11700 Da, 11650 Da, 

30 11550 Da, 11500 Da, 11133 Da, 11080 Da, 10830 Da, 9950 Da, 9700 Da, 9600 Da, 9197 
Da, 9140 Da, 9090 Da, 9079 Da, 8971 Da, 8940 Da, 8931 Da, 8930 Da, 8652 Da, 8580 
Da, 8230 Da, 7469 Da, 7324 Da, 7023 Da, 6880 Da, 6850 Da, 6660 Da, 6650 Da, 6635 
Da, 6450 Da, 6436 Da, 6435 Da, 6430 Da, 6125 Da, 6110 Da, 6090 Da, 5920 Da, 5906 
Da, 5905 Da, 5900 Da, 5871 Da, 5857 Da, 5540 Da, 5360 Da, 5330 Da, 5266 Da, 5260 

35 Da, 5234 Da, 5075 Da, 4977 Da, 4749 Da, 4660 Da, 4640 Da, 4634 Da, 4500 Da, 4480 
Da, 4460 Da, 4330 Da, 4300 Da, 4290 Da, 4281 Da, 4270 Da, 4266 Da, 4264 Da, 4168 
Da, 4136 Da, 4039 Da, 4024 Da, 4000 Da, 3984 Da, 3980 Da, 3960 Da, 3895 Da 3882 
Da, 3878 Da, 3816 Da, 3777 Da, 3712 Da, 3680 Da, 3651 Da, 3574 Da, 3570 Da (def 2), 
3487 Da, 3480 Da (def 3),3450 Da (def 1),3444 Da, 3408 Da, 3372 Da, 3280, 3275 Da, ' 



SUBSTITUTE SHEET (RULE 26) 



WO 2004/090550 PCT/DK2004/000263 

8 

Da, 3160, Da, 2960 Da, 2955 Da, 2933 Da, 2878 Da, 2850 Da, 2840 Da, 2799 Da 2693 
Da, 2462 Da, 2450 Da, 2364 Da, 2330 Da, 2275 Da, 2230 Da, 2210 Da, 1945 Da,l930 
Da, 1688 Da, 1536 Da, 1365 Da, 1256 Da, 1042 Da, 1026 Da, and 1005 Da, 

5 for the prediction of the clinical outcome, complications and mortality of an individual 
diagnosed with colorectal cancer. 

In the present context, the term "diagnosing" includes determining whether a person has 
colorectal cancer as well as indicating the stage or prognosis of a cancer in a patient. 



10 



As will be evident to a person of skill in the art, it is not always possible to diagnose with 
certainity whether a person has colorectal cancer by use of a method of the invention 
Within the broad term "diagnosing" is thus also included determining a diagnosis by use of 
at least one of the markers disclosed herein with a certain specificity i.e. 50% or 60% and 
15 preferably with a higher specificity, such as 70%, 75%, 80%, 83%, 84%, 85%, 86% 
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98<>/o, 99%, or most 
preferably 100%. 

The sensitivity of the method of diagnosing is also of importance. The sensitivity that the 
20 diagnosis provided by use of at least one of the markers disclosed herein is correct should 
be 50% or 60%, preferably higher such as 62%, 70%, 72%, 74%, 77%, 80%, 83% 84% 
85o/o, 86%, 870/0, 88%, 89%, 90o/ 0 , 91%, 92%, 9 3 o/o, 94%, 95%, 96%, 97%, 98%, 99%,' 
or most preferably 100%. 

25 The experimental part of the application provides a number of examples of preferred 
markers and combination of markers and the combination of specificity and sensitivity 
obtained when using said markers. These markers and combinations of markers are 
presently preferred embodiments of the invention. 

30 In the context of the present invention, the term "prognosis" relates to an opinion 

(professional or non-professional, preferably a professional) on how an illness or a disease 
w.ll develop and how the illness or disease will influence on other health conditions and 
death/survival of the mammal. 

35 It is contemplated that by use of at least one of the markers of the invention or a 

combination of markers it will be possible to determine the prognosis or clinical outcome 
for an individual patient. 
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The present invention provides the means for giving a prognosis of the clinical outcome, 
complications and mortality of said mammal. In the context of the present invention, the 
term "clinical outcome" relates to the 'final result' or the 'final situation' or the condition of 
the patient after the patient has experienced a disease, e.g. a colorectal cancer or related 
5 diseases of the gastrointestinal tract. Thus, the clinical outcome may be death within a 
year or survival, and survival can be everything from poor health condition (moribund) to 
a healthy period for a period of 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 years. 

In the context of the present invention, the term "complications" relates to symptoms of 
10 anything arising after the diagnosis of the disease, e.g the cancer spreading to other 
organs or tissues (metastasis), recurrence of carcinoma within the colon or development 
of a second primary colorectal cancer. 



It should be understood that any feature and/or aspect discussed above in connection with 
15 the determination according to the invention apply by analogy to the "diagnosis", 
"prognosis" and "clinical outcome" according to the invention. 

The term "colorectal cancer" relates to diseases such as colon cancer, familial 
adenomatous polyposis (FAP), rectal cancer and inflammatory bowel disease (IBD). It also 
20 relates to the non-invasive pre-cancerous lesions such as adenomatous polyps. 

In the present context, the term "phases of colorectal cancer" relates to the progressive 
stage of the disease. This diagnosis of the severity of colorectal cancer is most often based 
on pathological observations after surgery. This currently used diagnostic model describes 
25 colorectal cancer progression from normal epithelia to metastasis through the phases of 
dysplasia, adenoma (early, intermediate and late) and carcinoma. 

In the present context, the term mammal refers to a primate, preferably a human. 

30 In order to detect the presence of a gene product in a biological sample, one can measure 
either DNA/RNA or protein or both using quantitative detection assay(s). Such detection 
assay can be selected from the group consisting of immunoassay, kinetic/real-time PCR, 
2D gel, protein array, gene array and other nano-technology methods. 

35 In the present context, the term "immunoassay" refers to assays such as EUSA (Enzyme- 
Linked Immunosorbent Assay), RIA (Radioimmunoassay) and FIA (Fluoroimmunoassay), 
which are based on the EUSA sandwich concept of catching antibody and detection 
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antibody with different specificity to the same moiecule. The detection antibody is then 
labeiled with an enzyme, fluorochrome or a radioactive substance or the like, to quantify 
the desired molecule (protein), and the sensitivity of the assay depends partially on the 
label of the detection antibody. 

5 

The term" 2D-Gel" (two-dimensional electrophoresis) relates in the present context to the 
electrophoresis technique where a protein extract is subjected to an electrophoresis in one 
dimension and then directly afterwards to a second electrophoresis in a second dimension. 
The conditions during the separate steps are different, in terms of time of separation 
10 voltage, buffer and agents present during the separation. 

In a preferred embodiment of the present invention mass spectrometry is used to detect 
the protein markers. Furthermore the mass spectrometry method used is preferab.y a 
SELDI-TOF (Surface Enhanced Laser Desorption Ioni 2 ation)-TOF (Time of Flight) 
15 technique, where the protein extract is bound to a protein chip. The chips have an active 
surface chemistry, which can be modified to retain proteins with certain properties 
Thereby, proteins with different properties can be retained by different set of conditions 
and measured by MALDI-TOF or the like. 

20 The SELDI-TOF/MS technique: 

General description: 

SELDI-TOF/MS (Surface Enhanced Laser Desorption/Ionfsation-Time Of Flight/Mass 
Spectrometry) (Ciphergen) Is mass spectrometry where the samples are purified on 
25 Proton Chips (Ciphergen) prior to analysis. In this purification step the majority of proteins 
and salts & lipids) are removed and only a relatively small number of proteins remains on 
the chip surface. This chip is then analysed by mass spectrometry. 

Protein chips and buffer solutions: 
30 Different chips are available, and by varying the buffer solutions used in the binding and 
washing steps, different protein profiles are observed when ana.ysing the chips in the PBS 
I instrument (Ciphergen). Thus, a person skilled in the art normally would test different 
chips and buffer solutions. 

35 The chips are composed of common chromatographic materials, also used in HPLC 
techniques (anion-, cation, and hydrophobic-Zreverse phase- surfaces) and the buffer 
so.ut.ons are a.so commonly used in other purification techniques. There is basically no 
deference between purification on a protein chip, as described here, and purification on a 
chromatographic column or by precipitating proteins by chromatographic pearls 
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10 



Analysis on the SELDI instrument: 

The chips are analysed on the PBS II instrument (Ciphergen), which is an MALDI-TOF/MS 
(Matrix assisted Laser Desorption/Ionisation-Time Of Flight/Mass Spectrometry) 
instrument. The PBS II has a special loading device that allows analysis of protein chips, 
but is otherwise a normal MALDI-TOF/MS instrument. Instead of using specific chips, wtth 
specific binding abilities, a gold chip (Au Chip (Ciphergen)) can be used. In this case the 
protein solution is not purified on the chip but applied directly on to the gold surface and 
left to dry up together with the crystallisation solution; this is MALDI-TOF/MS. 

Some proteins are present at very low concentrations in serum and can therefore only be 
detected after they have been in-concentrated on the protein chip (which is the initial step 
in the SELDI technique) and not directly by MALDI. 

15 The only difference between SELDI and MALDI is that in SELDI the samples are treated in 
a special way before analysis. This allows for homogenous comparison of samples, which 
allows for sensitive identification of proteins that are differentially expressed. But the way 
the proteins are detected is exactly the same. 

20 Basically, there is no difference between purifying proteins by standard purification 
techniques ((nano)-HPLC, gel-filtration, precipitation) and subsequently analysing the 
samples by MALDI, and by purifying the proteins on protein chips and analysing them by 
MALDI. The latter combination is called SELDI. 

25 MALDI-TOF/MS is a technique that is highly sensitive in measuring the mass of molecules, 
especially proteins. The PBS II instrument has an accuracy of below +/- 0.2%, and in most 
cases around +/- 0.1%. Thus the mass value of a protein with m/z: 5000 is in most case 
m/z 5000 +/- 5. Therefore the measured masses are all defined as +/- maximum 0.2% 
and +/- minimum 0.1%. 



30 



Protein chips of the invention can be chips with an immobilized metal affinity capture array 
with a nitriloacetic acid (NTA) surface. An example of such a chip is the IMAC30 
ProteinChip Array, which is activated with transition metals prior to use. 

35 Other protein chips of the invention are arrays comprising a carboxylate chemistry 

(negatively charged) acting as a weak cation exchanger. The CM 10 ProteinChip Array is an 
example of such an array. 
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Protein chips of the present invention may further be arrays, which bind proteins through 
reversed phase or hydrophobic interaction chromatography and have binding 
characteristics similar to that of a C6 to C12 alkyl chromatographic resin. The H50 
ProteinChip array is an example of such an protein chip. 

5 

The protein chips of the present invention can also be arrays being strong anion exchange 
array comprising quaternary amine functionality such as the SAX2 ProteinChip Array. 

Furthermore, the protein chips of the present invention can be mimic normal phase 
10 chromatography with silicate functionality such as the NP20 ProteinChip. 

In the present context, the term "gene microarray" relates to low density nucleotide 
arrays, where nucleotide probes are attached or synthesised onto a surface and used as 
probes to retain nucleotides, mostly mRNA. This is usually referred to as transcription 
15 profiling, i.e. detection of the mRNA transcripts currently being used in a tissue at a certain 
time. Examples of such arrays are oligonucleotide arrays, where oligonucleotides are 
printed on glass slides and cDNA arrays, where cDNA (complementary DNA) is spotted on 
glass slide. 

20 In a preferred embodiment of the present Invention, the intensity signal detected in the 
quantitative detection assays Is selected from the group consisting of fluorescence signal, 
mass spectrometry images, radioactivity, enzyme activity, and antibody detection. 

The reference value can be calculated from a pool of samples from individuals with cancer 
25 and by comparison with a pool of samples from healthy individuals, a range for positive 
and negative calls can be made. Another possibility is to set a reference value based on a 
pool of samples from various phases or stages of the cancer to determine the progression 
or a stage of the disease. It may even be desirable to set reference values for prognosis of 
the disease. The reference value can be calculated as a mean or a median value of each 
30 intensity signal value(s) calculated from data from one or many of the markers, wherein 
the negative values are made positive. The reference value could even be the area under 
the curve (AUC) of at least one of the protein markers. 

In one embodiment of the present invention the reference value is indicative of the stage 
35 of colorectal cancer. This may be accomplished by collecting a number of samples from 
several patients and after the samples have been diagnosed by the stage of the disease, 
the samples from the same stage are assayed. 
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In the present context, the reference value can be based on data calculated from intensity 
signal value(s) of said marker(s) obtained from a sample without colorectal cancer from 
the same mammal. The reference value can also comprise data calculated from intensity 
signal value(s) of said marker(s) obtained from samples from normal and colorectal cancer 
5 tissue from the same mammal. Samples can furthermore be obtained from both a healthy 
control population and a population having said cancer which samples are used to 
determine the reference value. After the reference value is determined with a statistical 
significance, such as but not limited to p-values of levels below 0.1. By assaying a 
significant number of patients and healthy individuals, the specificity of the method can be 
10 determined, obtaining a specified sensitivity. Thereby, it can be determined whether a 
person is likely to have colorectal cancer or not with a predetermined specificity and/or a 
predetermined sensitivity. 

In the present context the term "data" relates t P any calculation made using the intensity 
15 signal(s) as data input. The intensity signal(s) may be fluorescence signal, mass 

spectrometry images, radioactivity, spectrometry values, etc. The data can be obtained 
using any kind of mathematical formula or algorithm. 

Samples for setting the reference value will vary depending on the purpose of the assay. 
20 For diagnosis tissue samples may be taken from a "normal" tissue section and a cancer 
from the same individual, but reference samples may also be taken from healthy 
individuals in this context. It is also possible to collect blood samples from healthy 
individuals together with blood samples from individuals, which are known to be suffering 
from colorectal cancer. 

25 

The prognosis of cancer patients is usually determined by the stage of the disease. The 
classification or the staging of the disease can be made using more than one model, but 
the most commonly used classification of colon cancer is based on the tumour morphology. 
This is the so-called Dukes' classification (referring to the original classification described 

30 by Lockhardt-Mummery & Dukes in the 1930 'ies) classifying the disease into three stages 
using the terms Dukes' A-C. Dukes A describes a cancer, where the cancer is limited to the 
lining (mucosa or sub-mucosa) of the colon and has not penetrated the colon. At the 
Dukes' B stage, the cancer has penetrated the muscularis propria and invaded nearby 
organs. Dukes' C is characterised in that a regional metastasis of lymph nodes has 

35 occurred. Later, a commonly used stage "Dukes' D", referring to colorectal cancer with 
distant metastasis to organs like liver, lungs and brain was added to the classification. The 
5-year survival prognosis for colorectal cancer is 80-90% at the Duke's A stage. Patients 
with Duke's B colorectal cancer have 60-70% 5-year survival rate whereas patients with 
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Duke's C colorectal cancer are down to 20-30%. The 5-year survival rate for patients with 
Duke's D colorectal cancer is practically zero (Arenas JW. et al.). 

In a preferred embodiment of the present invention the reference value is indicative of the 
5 stage of colorectal cancer, wherein the stage is selected from the group consisting of 
Duke's A, Duke's B, Duke's C and Duke's D. 

In the present context, the sample is a biological sample. The sample can be selected from 
the group consisting of blood, serum, plasma, faeces, saliva, urine, a cell lysate, a tissue 
10 sample, a biopsy, a tissue lysate, a cell culture, semen, seminal plasma, seminal fluid and 
cerebrospinal fluid. 



In a preferred embodiment of the present invention a protein extract is made from the 
biological sample containing the total protein content including membrane proteins, 

15 nuclear proteins, cytosolic proteins and blood/serum proteins. When the protein extract 
has been established, the protein concentration of the extract is made constant. In the 
present context the term constant refers to that the protein concentration of the sample to 
be analysed should be standardised to a value being the same between different samples 
in order to be able to quantify the signal of the protein markers. Such standardisation 

20 could be made using photometry, spectrometry and gel electrophoresis. 

In a presently preferred embodiment of the present invention, the intensity signal for 
markers 2850 Da, 3570 Da (def 2), 3450 Da (def 1), 3480 Da (def 3), 4270 Da, and/or 
6850 Da, is preferably increased, whereas the intensity signal for markers 9090 Da and/or 
25 12000 Da is preferably decreased. These markers are preferably selected for evaluation of 
the presence of the disease from tissue samples or biopsies. Furthermore, for evaluation of 
the presence of the disease from blood samples, the intensity signal for 5900 Da, 3882 Da, 
and/or 5906 Da, is preferably raised and the intensity signal for 3816 Da, 6436 Da, 13265 
Da, 11133 Da, and/or 13331 is preferably decreased. 



30 



35 



In a presently most preferred embodiment of the present invention, the intensity signal for 
markers 1945 Da and 2210 Da is decreased and the intensity signal for 5906 is increased. 
These markers are preferably selected for evaluation of the presence of the disease from 
blood samples. 

In another presently preferred embodiment of the present invention, the intensity signal 
for markers 1945 Da, 2210 Da, 2230 Da, 2250 Da, 2275 Da, 4300 Da, 4480 Da, and/or 
4500 Da is decreased. These markers are preferably selected for evaluation of the 
presence of the disease from blood samples. 
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In a further presently preferred embodiment of the present invention, the intensity signal 
for marker 5906 Da is raised. This marker is preferably selected for evaluation of the 
presence of the disease from blood samples. 

Also in a presently preferred embodiment of the present invention, the intensity signal for 
marker 1945 Da is decreased. This marker is preferably selected for evaluation of the 
presence of the disease from blood samples. 

10 Also in a presently preferred embodiment of the present invention, the intensity signal for 
marker 2210 Da is decreased. This marker is preferably selected for evaluation of the 
presence of the disease from blood samples. 

One aspect of the present invention provides the use of degradation products of Human 
15 Serum Albumin as marker for cancer. The degradation products are selected from the 

group consisting of the polypeptides having apparent molecular weights of 60500 Da, 6187 
Da, 6090 Da, 5920 Da, 5906 Da, 5901 Da, 5900 Da, and 5333 Da. 

In an embodiment of the present invention the use of at least one polypeptide having 
20 apparent molecular weight of 6187 Da, 5901 Da, or 5333 Da as a marker for cancer is 
provided, wherein at least one of the polypeptides is alpha-fibrinogen protein. In the 
present context the cancer is colorectal cancer. 

In a presently preferred embodiment of the invention, the intensity signal for markers 
25 66800 Da, 66500 Da, 66300 Da, 64860 Da, 46000 Da, 45500 Da, 44300 Da, 28040 Da, 
28025 Da, 28010 Da, 28000 Da, 27700 Da, 15580 Da, 15140 Da, 13700 Da, 13331 Da' 
13265 Da, 12000 Da, 11989 Da, 11133 Da, 9700 Da, 9600 Da, 9197 Da, 9090 Da, 9079 
Da, 8971 Da, 8940 Da, 8931 Da, 8652 Da, 8580 Da, 8230 Da, 7324 Da, 7023 Da, 6880 
Da, 6660 Da, 6650 Da, 6635 Da, 6450 Da, 6436 Da, 6435 Da,6430 Da, 5360 Da, 5075 
30 Da, 4749 Da, 4660 Da, 4640 Da, 4634 Da, 4500 Da, 4480 Da, 4330 Da, 4300 Da, 4290 
Da, 4168 Da, 4000 Da, 3984 Da, 3980 Da , 3960 Da, 3816 Da, 3777 Da, 3680 Da, 3280 
Da, 3160 Da, 2450 Da, 2330 Da, 2275 Da, 2230 Da, 2210, 1945 Da, 1930 Da 1536 Da, 
1365 Da, 1256 Da, 1042 Da, 1026 Da, and 1005 Da is increased and the intensity signal 
for markers 66500 Da, 46000 Da, 45500 Da, 44300 Da, 28040 Da, 27700 Da, 15580 Da, 
35 15140 Da, 13700 Da, 13331 Da 13265 Da, 12000 Da, 11989 Da, 11133 Da, 9700 Da, 
9600 Da, 9197 Da, 9090 Da, 9079 Da, 8971 Da, 8940 Da, 8652 Da, 8580 Da, 8230 Da, 
7324 Da, 7023 Da, 6880 Da, 6660 Da, 6650 Da, 6450 Da, 6436 Da, 6430 Da, 5360 Da, 
5075 Da, 4749 Da, 4660 Da, 4640 Da, 4634 Da, 4500 Da, 4480 Da, 4330 Da, 4300 Da,' 
4290 Da, 4168 Da, 4000 Da, 3984 Da, 3980 Da , 3960 Da, 3816 Da, 3777 Da, 3680 Da, 
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3280 Da, 3160 Da, 2450 Da, 2330 Da, 2275 Da, 2230 Da, 2210, 1945 Da, 1930 Da 1536 
Da, 1365 Da, 1256 Da, 1042 Da, 1026 Da, and 1005 Da is decreased. 

In an embodiment of the present invention the intensity signal for markers 60500 Da 
5 19900 Da, 11080 Da, 10830 Da, 9140 Da, 8930 Da, 6110 Da, 6090 Da, 5920 Da, 5900 
Da, 5540 Da, 5330 Da, 5260 Da, 4460 Da, and 2960 Da is increased and the intensity 
signal for markers 66500 Da, 44300 Da, 28040 Da, 27700 Da, 15580 Da, 13700 Da, 6880 
Da, 6660 Da, 6430 Da, 4660 Da, 4640 Da, 4330 Da, 4300 Da, 4290 Da, 4000 Da, 3980 
Da , 3960 Da, 3680 Da, 3280 Da, and 3160 Da is decreased when assaying a serum 
10 sample on IMAC30 chip (Ciphergen). 

In an embodiment of the present invention the intensity signal for markers 11900 Da, 
11700 Da, 11650 Da, 11550 Da, and 11500 Da is increased and the intensity signal for 
markers 46000 Da, 45500 Da, 8940 Da, 8230 Da, 6650 Da, and 6450 Da is decreased 
15 when assaying a serum sample on H50 protein chip. 

In an embodiment of the present invention the intensity signal for markers 15200 Da, 
6125 Da, 5900 Da, 3275 Da, and 2955 Da is increased and the intensity signal for markers 
4290 Da, 2450 Da, 1536 Da is decreased when assaying a serum sample on CM10 protein 
20 chip. 

In an embodiment of the present invention the intensity signal for markers 33000 Da, 
16150 Da, 15935 Da, and 15200 Da is increased when assaying a serum sample on ' 
Sax2protein chip. 



25 



in an embodiment of the present invention the intensity signal for markers 5857 Da, 4264 
Da, 3878 Da, 3712 Da, 3651 Da, 3574 Da, 3487 Da, 3444 Da, 3372 Da, and 1688 Da is 
increased and the intensity signal for markers 9700 Da, 8652 Da, 8652 Da, 8580 Da, 7023 
Da, 5360 Da, 4168 Da, 1365 Da, 1256 Da, 1042 Da, 1026 Da, and 1005 Da is decreased 
30 when assaying a tissue sample on NP20 protein chip. 

In an embodiment of the present invention the intensity signal for markers 11987 Da, 
5871 Da, 5234 Da, 4281 Da, 4266 Da, 4039 Da, 4024 Da, 3408 Da, 2933 Da, 2878 Da 
2840 Da, 2799 Da, 2693 Da, 2462 Da, and 2364 Da is increased and the intensity signal 
35 for 15140 Da, 11989 Da, 9600 Da, 9197 Da, 9079 Da, 8971 Da, 7324 Da, 5075 Da, 4749 
Da, 4634 Da, 3984 Da, 3777 Da, 2330 Da, and 1930 Da is decreased when assaying a 
tissue sample on Sax2protein chip. 
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In a presently preferred embodiment of the invention the intensity signa. for markers 5340 
Da and 5906 Da is increased and the intensity signal for 3980 Da, 6880 Da, and 28010 is 
decreased when assaying a serum sample on IMac30 chip. 

5 in the present context, the term "plasma sample" relates to a samp.e wherein a blood 
sample is tapped into "EDTA-.iquid-glass", centrifuged and where the supernatant is 
optionally frozen immediately at -80°C. 

in the present context, the term "serum samp.e" re.ates to a sample wherein a blood 
10 samp»e is tapped into a dry-glass, .eft to coagulate at room temperature for one hour 
after which they are centrifuged and the supernatant is optionally frozen immediate.y'at - 

80 C. 

in the present context, the term "increased" in relation to the term "intensity signal" for a 
15 marker, refers to a comparison of an intensity signal from a samp.e to a reference value 
wherein the samples have been normalized to ion noise or "housekeeping genes". The ' 
mtensity signa. for a specific marker, having a certain size, weight, number of nucleotides 
or amino acids, is "increased" if it is higher in the samp.e as compared to the reference 
^ va.ue. If the term "raised" is used this is to be interpreted to a.so mean "increased". 

in the present context, the term "decreased" in relation to the term "intensity signa." for a 
marker, refers to a comparison of an intensity signal from a sample to a reference value 
wherein the samples have been normalized to ion noise or "housekeeping genes" The ' 
intensity signa. for a specific marker, having a certain size, weight, number of nucleotides 
25 or ammo adds, is "decreased" if it is lower in the samp.e as compared to the reference 
value. ^ ^ 



In one aspect of the present invention a method is provided for determining the presence 
of co orecta. cancer on the basis of a samp.e from a mamma.. The method comprises 
30 se.ect.ng a normalized protein expression data set from the sample, wherein the 

expression data set comprises a p.urality of expression intensities of proteins on at least 
one protein chip. Thereafter, at least one marker is selected from the normalized protein 
expression data set from the group consisting of the polypeptides having apparent 
molecular weight of: 



35 



66500 Da, 60500 Da, 46000 Da, 45500 Da, 44300 Da, 33000 Da, 28040 Da, 27700 Da 

HZ T 15935 15580 ^ 15200 Da ' 15140 Da ' 13700 Da ' ^3331 Da,' 

3265 Da, 12000 Da 11989 Da, 11987 Da, 11900 Da, 11700 Da, 11650 Da, 11550 Da, 

11500 Da, 11133 Da, 11080 Da, 10830 Da, 9700 Da, 9600 Da, 9197 Da, 9140 Da, 9090 
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Da, 9079 Da, 8971 Da, 8940 Da, 8930 Da, 8652 Da, 8580 Da, 8230 Da, 7324 Da, 7023 
Da, 6880 Da, 6850 Da, 6660 Da, 6650 Da, 6450 Da, 6436 Da, 6430 Da, 6125 Da, 6110 
Da, 6090 Da, 5920 Da, 5906 Da, 5900 Da, 5871 Da, 5857 Da, 5540 Da, 5360 Da, 5330 
Da, 5260 Da, 5234 Da, 5075 Da, 4749 Da, 4660 Da, 4640 Da, 4634 Da, 4500 Da, 4480 
5 Da, 4460 Da, 4330 Da, 4300 Da, 4290 Da, 4281 Da, 4270 Da, 4266 Da, 4264 Da, 4168 
Da, 4039 Da, 4024 Da, 4000 Da, 3984 Da, 3980 Da, 3960 Da, 3882 Da, 3878 Da, 3816 
Da, 3777 Da, 3712 Da, 3680 Da, 3651 Da, 3574 Da, 3570 Da (def 2), 3487 Da, 3480 Da 
(def 3),3450 Da (def 1),3444 Da, 3408 Da, 3372 Da, 3280, 3275 Da, Da, 3160, Da, 2960 
Da, 2955 Da, 2933 Da, 2878 Da, 2850 Da, 2840 Da, 2799 Da, 2693 Da, 2462 Da, 2450 

10 Da, 2364 Da, 2330 Da, 2275 Da, 2230 Da, 2210 Da, 1945 Da,1930 Da, 1688 Da, 1536 
Da, 1365 Da, 1256 Da, 1042 Da, 1026 Da, and 1005 Da. Thereafter the weight for said at 
least one marker is set and the intensities of said at least one marker is/are multiplied with 
the weight of said at least one marker. If the markers are more than one the sum of the 
multiplication obtained above is calculated and that sum value is compared with a cut off 

15 value (as explained in example 7). 

In the present context the weight for each marker is set by assigning a number between - 
0.9 and +0.9 to each marker. The exact number (between -0.9 and +0.9) fs selected as 
the number that results in the highest combination of a sensitivity and specificity value. 
20 This can be tested as shown in table 15 in example 7. 

In a presently preferred embodiment the determination is based on the following 
algorithm: 



25 



Give the selected markers weights between -0.9 and 0.9, i.e. marker A, weight a, 
marker B, weight b, marker C, weight c, marker D, weight d and marker N, weight 



get intensities of A, B, C, D,..N markers in the following order: A m/z, B m/z, C 
30 m/z, D m/z, N .. m/z; 

multiply the first intensity with weight a; 
multiply the second intensity with weight b; 
multiply the third intensity with weight c; 
35 multiply the fourth intensity with weight d; 

multiply the n intensities with weight n; and 
calculate the sum of the above multiplications. 

If sum lower than cutoff value => sample is negative for colon cancer. 
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If sum higher than cutoff value => sample is positive for colon cancer. 

In another aspect of the present invention a computer system for monitoring 
colorectal cancer in a mammal is provided. The computer system comprises a storage 
means for electronically storing data, processing means for storing input data from a 
mass spectrometer, input means for interfacing between an mass spectrometer and 
the computer system, and an interface between a user and the computer system, 
wherein the processing means determines the likelihood of colorectal cancer by 
applying the following algorithm: 

Give the selected markers weights between -0.9 and 0.9, i.e. marker A, weight a, 
marker B, weight b, marker C, weight c, marker D, weight d and marker N, weight n; 

get intensities of A, B, C, D,..N markers in the following order: A m/z, B m/z, C m/z, D 
15 m/z, N .. m/z; 

multiply the first intensity with weight a; 
multiply the second intensity with weight b; 
multiply the third intensity with weight c; 
20 multiply the fourth intensity with weight d; 
multiply the n intensities with weight n; and 
calculate the sum of the above multiplications. 

If sum lower than cutoff value => sample is negative for colon cancer. 
25 If sum higher than cutoff value => sample is positive for colon cancer. 

In the present context, the term "cutoff" in relation to the program refers to a value for 
classification. The predicted grouping of a sample is classified as positive for colon cancer if 
it is above the cutoff value and negative for colon cancer if it is below the cutoff value. 



30 



In mass spectrometry the measured mass is given i Daltons (Da) or m/z. Dalton is a 
weight unit, wherein m/z relates to mass over charge (mass/charge). In the present 
context there is no difference between Daltons (Da) or m/z. 

35 In the present context, the term "storage means" relates to hard disk, DVD disk, CD disk 
or floppy diskettes for storing digital data. 

In the present context, the term "processing means" relates to a computer comprising a 
processor, RAM memory, etc... 
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In the present context, the term "interface between a user and the computer system- 
relates to keyboard, computer mouse, and a monitor. 

in one aspect of the present invention a kit for diagnosis of colorectal cancer is provided, 
the kit comprising: a first antibody including a portion bound to a solid phase and a region 
which specifically binds to alpha-fetoprotein, a second antibody including a region which 
specifically binds to alpha-fetoprotein and a portion which has a label, and optionally a 
reference protein. 



in another aspect of the present invention a kit for diagnosis of colorectal cancer is 
provided, the kit comprising: a first antibody including a portion bound to a solid phase 
and a region which specifically binds to alpha-fibrinogen, a second antibody including a 
region which specifically binds to alpha-fibrinogen and a portion which has a label, and 
15 optionally a reference protein. 

in yet another aspect of the present invention a kit for diagnosis of colorectal cancer is 
provided, the kit comprising: a first antibody including a portion bound to a solid phase 
and a region which specifically binds to human serum albumin (HSA) or fragments of HSA 
20 a second antibody including a region which specifically binds to human serum albumin 
(HSA) or fragments of HSA and a portion which has a label, and optionally a reference 
protein. 



in an embodiment of the present invention the kit for diagnosis of colorectal cancer may 
25 comprise components to detect one or more of the proteins alpha-fetoprotein, alpha- 
fibrinogen and human serum albumin (HSA). The antibodies may recognise epitopes which 
are only exposed when the protein is degraded. 

In the present context the term "epitope" relates to a certain area on the surface of the 
30 protein comprising a number of amino acids. 

Several mutations in oncogenes and tumour-suppresser genes have been identified in 
colorectal cancer. The majority of these genes are associated with certain phases of the 
disease. A mutation in the tumour-suppresser gene Adenomatous Polyposis Coli gene 
35 (APC), is considered to be a molecular -gatekeeper" for development of adenomas and it 
has been estimated that over 8 0 o/ o of all colorectal cancers have a somatic mutation in the 
APC gene. There are actually very few oncogenes, which have been shown to be involved 
with colorectal cancers apart from k-ras, but a small percentage of colorectal cancers show 
mutat.ons in the myc, myb and neu oncogenes. A mutation in k-ras is considered to be an 
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intermediate event in colorectal carcinogenesis advancing the disease from early adenoma 
to intermediate adenoma. Several other products of tumour-suppresser genes have also 
been associated with colorectal cancer, many of those genes are located on the long arm 
of chromosome 18. Allelic loss on 18q has been associated with the DCC gene (deleted in 
colorectal cancer), MADR2 gene (also known as JV18) and DPC4 gene (deleted in 
pancreatic cancer), the last two are players In the TGF-beta signalling pathway. It has 
been proposed that DCC, DPC4 and MADR2 play a role in the progression over to late 
adenoma (Gryfe R et al.). 

One of the best known and studied tumour-suppresser genes, p53, is associated with 
driving the disease towards carcinoma. The product of the gene, which is located on 
chromosome 17, is a nuclear protein and has a function in cell cycle regulation, but a loss 
of heterozygocity on 17p has been demonstrated in over 70% of all colorectal cancers. 

In a preferred embodiment of the present invention, the detection method using at least 
one of the novel protein markers for the detection of colorectal cancer could be 
supplemented with the detection of one or more protein markers selected from the group 
consisting of APC, k-ras, myc, myb, neu, DCC, DPC4, MADR2, p53, BCMP, CJA8, CZA8, 
BCX2, CBC2, CBC1, CBC3, CJA9, CGA7, BCN5, CQA1, BCN7, CQA2, CGA8, CAA7, CAA9, 
PKC isozyme, bcl-2, bax, TIMP-1 and c-myc. 

Figure legends 

Figure 1. 

Average intensity values of markers of colorectal cancer. Tissue samples from 12 cancer 
patients including a normal tissue sample and cancer tissue sample from the same 
individual were homogenised and protein extracts were analysed by mass-spectrometry 
using SAX2 chips and the SELDI-TOF technique. The figure shows the intensity levels of 
the markers selected based on highest sensitivity and specificity. 

Figure 2. 

Discriminating values calculated for 8 markers. The average intensity value for each 
marker was calculated for normal and cancer tissue sample sets, after removing the 
highest and lowest values. The discriminating value for each marker was found by dividing 
the average intensities from each of the sample sets. 

Figure 3. 

Average intensity values of possible markers in serum. Serum samples from 10 cancer 
patients and 10 healthy individuals were analysed by mass-spectrometry using IMAC3 
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chips and the SELDI-TOF technique. The figure shows the intensity levels of the markers 
selected based on highest intensity. 



Figure 4 
5 Serum marker: 1945 Da. 
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Threshold value: 8.9 (maximum value for cancer serum) 

* 

12 out of 78 normal serum samples fall below threshold, producing a specificity of 85%. 

10 Figure 5 

Serum marker 2210 Da 
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Threshold value: 12.7 (maximum value for cancer serum) 

18 out of 78 normal serum samples fall below threshold, producing a specificity of 77%. 



Figure 6 

Serum marker 2230 Da 
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Threshold value: 5.6 (maximum value for cancer serum) 
20 18 out of 78 normal serum samples fall below threshold, producing a specificity of 77%. 

Figure 7 

Serum marker 2250 Da 
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Signal intensity 


• 

Cancer 


Normal 


mid 


1.204193541 


7.006661 


max 


3.640628662 


20.46203 


min 


0.234108032 


0.550792 



Threshold value: 3.6 (maximum value for cancer serum) 

♦ 

22 out of 78 normal serum samples fall below threshold, producing a specificity of 72%. 



5 Figure 8 

Serum marker 2275 Da 
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Threshold value: 3.1 (maximum value for cancer serum) 

30 out of 78 cancer serum samples fall below threshold, producing a specificity of 62%, 
Figure 9 

Serum marker 4300 Da 
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10.52571 


mm 


0.029092626 


0.225152 



Threshold value: 1.1 (maximum value for cancer serum) 
15 20 out of 78 cancer serum samples fall below threshold, producing a specificity of 74%. 



Figure 10 

Serum marker 4475 Da 



Signal intensity 


Cancer 


Normal 


mid 


0.828595247 


3.363255 


max 


2.067939342 


7.826388 


mm 


0.035968835 


0.900171 
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Threshold value: 2.1 (maximum value for cancer serum) 

20 out of 78 cancer serum samples are below threshold, producing a specificity of 74%. 



Figure 11 
5 Serum marker 4500 Da 



Signal intensity 


Cancer 


Normal 


mid 


0.821256006 


3.360526 


max 


2.067939342 


7.826388 


min 


0.035968835 


0.889889 



Threshold value: 2.1 (maximum value for cancer serum) 

20 out of 78 cancer serum samples are below threshold, producing a specificity of 74%. 

10 Figure 12 

Serum marker 5.9 Da. 



Signal intensity 


Cancer 


Normal 


middle 


5.088206618 


1.413438 


max 


13.43115416 


5.412548 


min 


0.638267678 


0.182963 



Threshold value: 5.4 (maximum value for normal serum) 

49 out of 78 cancer serum samples fall below threshold, producing a specificity of 37%. 
Figure 13 

Peptide pattern in the region from 1900 to 2500 Da. 



Figure 14 

20 Mass spectra from a same sample analysed by the SELDI TOF technique (A) and the 
MALDI-TOF technique (B) 

Figure 15 

A scatter-plot of the sample scores and variable loading of a data set comprising data from 
25 healthy individuals and individuals diagnosed with colon cancer. 
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Figure 16 

A and B. Representative SELDI-TOF/MS spectra of normal colon tissue (A) on NP20 chip 
and normal serum (B) on iMAC30 chip. The two spectra differ significantly and each 
produce a total of 40 to 60 peaks, the majority of which lie in the specified range from 2 to 
5 10 kDa. 

C. Comparison of typical colon tumour spectrum (above) and normal colon spectrum 
(below) in the range from 3 to 4 kDa. The arrows point to the three differentially 
expressed peptides, subsequently identified as HNP 1-3. The three peptides are expressed 
in both the normal colon samples and the colon tumour samples, but the expression is up- 
10 regulated in the cancer samples. The same observation was made in the serum screening, 
but here the average signal intensity was significantly lower. 



Figure 17 

A. HNP profiles of normal and colon tumour tissue. 40 colon tumour and 40 normal colon 
15 tissue samples were analysed on NP20 chips. Differences in mean intensities of HNP1-3 in 

normal and colon tumour tissue are statistical significant at 5% level (p<0.0005). 

B. HNP profiles of normal and colon cancer serum. Serum samples (125 colon cancer and 
100 normal) were analysed on iMAC30 chips. The mean intensities are significantly 
different at 5% level (p<2.2e-16). The box-plot shows the 25th quintile, median, 75th 

20 quantile, and whiskers extend to min. and max. values. 



Figure 18 

Protein extract from tumour tissue was separated on a peptide gel-filtration column. The 
elution volumes of forty (unidentified) peptides is plotted against their respective mass 
25 values and an approximate elution curve is calculated. The arrows point to HNP 1-3, which 
are eluted in two fractions: in the void volume (8ml) together with High Mass proteins 
(above 20 kDa) and after 14ml together with peptides of similar mass range (2-4kDa). We 
interpret this as evidence for binding between HNP 1-3 and High Mass proteins. 

30 Figure 19 

Normal microscopy (A&B) and fluorescence microscopy (C&D) of MDCK cells. MDCK cells 
were exposed to calcein with (A&C) and without HNP 1-3 (B&D). By fluorescence 
microscopy (C&D) the cells were observed to uptake calcein only when treated with 
fractions containing HNP l-3/calcein(C). Fractions containing other peptides (unidentified 
35 peptides also purified from colon tumours) were used as negative controls together with 
calcein and did not stimulate the cells to uptake calcein (D) Also, cell islands treated with 
HNP 1-3 appeared diffuse and showed enlarged nuclei, indicating apoptosis (A). 
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Figure 20 

A-E shows the average intensity spectra of healthy individuals (solid) and patients 
diagnosed with colon cancer (dashed). The standard errors of means (SEM) are shown 
5 with bars. 



Examples 

EXAMPLE 1 IDENTIFICATION OF BIOMARKERS FOR COLORECTAL CANCER BY TISSUE 
INVESTIGATIONS 



10 



The aim of the study was to identify protein markers indicative of colorectal cancer by 
comparison of norma! and cancer tissue from colon and rectum. 

Method 

15 Sample preparation 

Samples from 12 cancer patients were collected. Normal tissue samples and cancer tissue 
samples from the same colon were taken and frozen at -80°C. Prior to analysis the 
samples were taken out of the freezer and placed into homogenisation/Lysis buffer. 

20 Lysis buffer: 

100 mM TRIS, pH 8.0 
9.5 M UREA 
1% CHAPS. 

25 The samples were homogenised in a Wheaton Overhead Stirrer for 2 minutes at speed 
step 2. 



30 



Analysis 

Protein extracts were analysed by mass-spectrometry using the SELDI-TOF technique. 
SAX2 chips were pre-treated with 50 pi 100 mM TRIS pH 8.0 buffer. 



10 pi homogenised sample + 60 pi TRIS pH 8.0 buffer were mixed and incubated on SAX2 
Chip in a Bioprocessor for 30 minutes at room temperature. Thereafter spots were washed 
35 twice in 250 pi 100 mM TRIS pH 8.0 for 5 minutes. 
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2 times 0.5 p! Matrix (CHCA) was applied onto spot surface. 
Instrument settings 

Proteinchips were analysed at Laser intensities of 190, 210, and 230, and the sensitivity 
5 level was set at 8. 



10 



Results 

Putative markers were identified by visual examination of the mass spectra from cancer 
and normal samples. 

Table 1. Mass values of proteins showing increased expression in cancer tissue: 



Laser Intensity 


190 


210 


230 


2305 Da 


2305 Da 


2305 Da 




2460 Da 


2460 Da 




2840 Da 




2850 Da 


2850 Da 


2850 Da 


2991 Da 






3370 Da 


3370 Da 




3440 Da 


3440 Da 




3480 Da 


3480 Da 






4275 Da 


4275 Da 






6850 Da 



Table 2. Mass values of proteins showing decreased expression in cancer tissue 



Laser Intensity 


190 


210 | 230 


1925 Da 








• 


1940 Da 






5000 Da 






6190 Da 




6375 Da 






6575 Da 








6590 Da 






7570 Da 






8410 Da 






8700 Da 
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9090 Da 






11670 Da 






12000 Da 



Possible markers: 

In order to the determine the specificity and sensitivity of the possible markers all spectres 
were normalised based on total ion current. 

5 

Table 3. Specificity and sensitivity of protein markers showing increased expression in 
cancer tissue: 



Size (Da) 


Specificity (°/o) 


Sensitivity (%) 


2300 


83 


66 


2460 


75 


83 


2850 


100 


92 


2840 


66 


92 


2990 


75 


50 


3370 


75 


83 


3450 


83 


83 


3480 


83 


92 


4270 


92 


92 


6850 


91 


92 



Table 4. Specificity and sensitivity of protein markers showing decreased expression in 
10 cancer tissue. 



Size (Da) 


Specif icity(°/o) 


Sensitivity (%) 


1920 


75 


50 


1940 


67 


25 


5000 


50 


50 


6190 


83 


75 


6375 


67 


100 


6575 


58 


58 


7590 


83 


50 


8410 


58 


42 


8700 


66 


58 


9090 


83 


83 


11670 


83 


50 


12000 


83 


83 
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Possible multi-protein marker: 

Based on values of sensitivity and specificity the most promising single protein markers 
were selected: 



5 Table 5. Protein markers showing increased expression in cancer tissue 



Size (Da) 


Specificity(°/o) 


Sensitivity (°/o) 


Identification 


2850 


100 


92 


N.D. 


3370 


75 


83 


Alfa-Defensin-2 


3450 


83 


83 


Aifa-Defensin-1 


3480 


83 


92 


Alfa Defensin-3 


4270 


92 


92 


N.D. 


6850 


92 

1 


92 


N.D. 



Table 6. Protein markers showing decreased exp ression in cancer tissue. 



Size (Da) 


Specificity(°/o) 


Sensitivity (%) 


Identification 


9090 


83 


83 


N.D. 


12000 


83 


83 


N.D. 



10 Conclusion 

Eight promising single protein markers were found using the SELDI-TOF mass- 
spectrometry technique and applying samples on protein-chips. Three of the markers have 
been fully identified as Alpha-Defensin 1, 2, and 3. A multi-protein marker based on a 
combination of one or more of the eight proteins shown above appears to be a very 
15 effective way of screening for colorectal cancer. 

EXAMPLE 2 IDENTIFICATION OF BIOMARKERS FOR COLORECTAL CANCER IN SERUM 

The aim of the study was to identify protein markers indicative of colorectal cancer by 
20 comparison of serum samples from normai and cancer patients. 

Method 

Sample preparation 

Serum was isolated from blood of 10 patients diagnosed as having colorectal cancer and 
25 10 healthy individuals. 
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Analysis 

An IMAC3 chip was pre-treated with 2 times 5 pi lOOmM NiS04 followed by wash with 5 pi 
MQ water and equilibration with 2 times 5 pi binding buffer. 

5 Binding buffer: 

lOOmM TRIS HCI, pH 7.5 
500mM NaCI 
0.1% Triton X-100 

10 2 pi of each serum sample was diluted in 48 pi binding buffer of which 4 pi was applied to 
the protein chip surface. The chip was left on shaker at room temperature for 40 minutes. 
The sample was removed from the chip surface and each spot was washed with 3 times 5 
pi washing buffer (PBS, pH 7.4, 700mM NaCI). Finally the chip was air-dried and 2 times 
0.6 pi CHCA (100%) was applied to each spot. 



15 



Protein extracts were analysed by mass-spectrometry using the SELDI-TOF technique. 



Instrument settings 

Protein-chips were analysed at varying laser intensities and sensitivity levels to obtain 
20 optimal spectra. 



Resufts 

Sensitivity and specificity of putative serum markers: 

25 Table 7. Protein markers showing increased eypr^inn m serum samples of cancer 
patients: 



Size (Da) 


Specif icity(°/o) 


Sensitivity (%) 


Identification 


5905 


70 


70 

— 


N.D. 


5899 


~70 " 




N.D. 


5928 


70 


70 


N.D. 


3882 


~60 


~60 


N.D. 


Table 8. Protein markers showina decreased expr^nn in comn, ^mpl^ nf rnnccr 
patients. 


Size (Da) 


Specificity(%) 


Sensitivity (%) 


Identification 


3816 


60 


60 


N.D. 
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6435 


60 


60 


N.D. 


13265 


60 


60 


N.D. 


11132 


50 


50 


N.D. 


13331 


50 


50 


N.D. 



Conclusion 

Eight possible single protein markers were found using the SELDI-TOF mass-spectrometry 
technique and applying serum samples on protein-chips. None of the markers have been 
5 fully identified and annotated. A multi-protein marker based on a combination of one or 
more of the eight proteins shown above appears to be a very effective way for diagnosis of 
colorectal cancer. 



EXAMPLE 3 SERUM SCREENING 

1 0 Materials and method 
Chip: 

Serum samples were analysed on IMAC3 chip (Ciphergen). 
Pre treatment: 

Each spot is outlined with hydro pen. 
15 5pl 100 mM NiS04 is added, shake (150rpm) 1 min. Remove. Repeat once. 
5pl MQ water is added shake 1 min. Remove. 
5pl Bind buffer is added shake lmin. Remove. 



Binding step : 
20 Chip is placed in Bioprocessor. 

50 pi binding buffer + 5^1 serum is mixed in eppendorf tube, solution is loaded in 
bioprocessor. Leave on shaker (250 rpm) for 40 min. Remove. 

Washing step: 

25 200 ill washing buffer is added. Shake (250 rpm) 5min. Remove. Repeat once. 
Drv step: 

Chips are removed from bioprocessor and left to air dry for 20 minutes. 

30 Crystallisat ion step : 

0.6 jil matrix solution is added to each spot. Air dry chip for 5 min. 
Repeat once. 
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Analysis: 

Chips are analysed on PBS II instrument (Ciphergen) at laser intensity 210 and detector 

« 

sensitivity 4. 

5 Results 

Biomarker Wizard analysis 

78 colon cancer serum and 78 normal serum samples were analysed as described above. 
All spectra were pooled and normalised based on total ion current. 

10 

Possible markers were identified by Biomarker Wizard (Ciphergen) analysis with the 
following parameter settings: 

First pass: 5, Min peak threshold: 0%, Cluster mass window: 03 %, Second pass: 5. 
15 Based on the results from the Biomarker Wizard 9 peptides showed promising marker 
characteristics. 



Mass values of possible serum marker peptides: 

20 

Down-regulated in colon cancer serum: 
1945, 2210, 2230, 225Q, 2275, 4300, 4480, 4500 Da. 
25 Up-regulated in colon cancer serum: 
5906 Da. 

Threshold va lues for possible serum markers 
30 Optimal threshold values for the 9 serum markers were selected in order to determine 
maximum specificity of individual markers: 



Marker (Da) 


Specificity (°/o) 


1945 


85 


2210 


77 


2230 


77 


2250 


72 
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2275 


62 


4300 


74 


4480 


74 


4500 


74 


5906 


37 



Principal component analysis 
. Based on principal component analysis of a sample set of 38 cancer serum and 31 normal 

serum, it was shown that especially three markers were of high importance for 
5 discriminating between cancer and normal serum. 

Conclusion 

Especially important markers: 1945 Da, 2210 Da, and 5906 Da. 

10 EXAMPLE 4 USE OF SELDI-TOF/MS OR MALDI-TOF/MS FOR DETECTION OF BIOMARKERS 
FOR COLORECTAL CANCER. 

The aim of this study was to compare the outcome of markers detected with different 
expression of proteins in healthy individuals vs. patients diagnosed with colorectal cancer, 
using either SELDI-TOF/MS or an MALDI-TOF/MS. 

15 

Method 

The PBS II instrument allows variation of three important parameters when analysing 

protein chips or MALDI-TOF/MS samples. 

Laser intensity, detector sensitivity and optimisation range. 

20 

Laser intensity was permanently set at 220. However, since the laser source is constantly 
becoming weaker as the instrument is being used, and varies significantly from instrument 
to instrument, this is not a value that has any general meaning. Most often values from 
190 to 230 are chosen. 

25 

Detector sensitivity was set at values of 3,4,5,6,7,8 depending on the signal. The intensity 
(and only the intensity, not the protein profile) of the sample is highly dependent on the 
matrix solution which is made immediately prior each screening. The detector sensitivity 
value is chosen such that none of the protein peaks will ever produce a signal that 
30 overrides the maximum limit. Thus the appropriate detector value will depend on the 
specific matrix solution, and thus has no general meaning. 



SUBSTITUTE SHEET (RULE 26) 



WO 2004/090550 



PCT/DK2004/000263 



34 



Optimisation range, this range specifies the mass interval where the instrument will 
measure the signal with highest accuracy. For each screening we made two 
measurements. One with low optimisation range (m/z 2000-20000) and one with high 
(m/z 20000-150000) The identified markers below m/z 20000 were all measured in the 
5 low screening and the markers above m/z 20000 were all measured in the high screening 

Protein chips were analysed on the PBS II SELDI instrument (Ciphergen). 

SPA (Sinapinic Acid) matrix was used in the crystallisation step in all screenings: 

10 SPA (Ciphergen) was dissolved in 150 pi MQ + 150pl Acetonitrile + 1,5 pi TFA (tri-flouro- 
acetic-acid) and left on shaker for 10 minutes and centrifuged at 14.000 rpm for 15 
minutes. 

Analysis 

Mass spectra from serum samples of healthy individuals and patients diagnosed with 
15 colorectal cancer were analysed for potential markers. 

An analysis of a serum sample by SELDI-TOF/MS indicated a protein marker of m/z 5900. 
The same sample was prepared for MALDI-TOF/MS analysis by removing salt and lipids 
from serum by gel-filtration. The results shown in figure 14 disclose this same protein with 
the same mass value as the SELDI-TOF/MS analysis did. 



20 



EXAMPLE 5 IDENTIFICATION OF SERUM MARKERS USING SEVERAL TYPES OF PROTEIN 
CHIPS 

The aim of this study was to analyse the effect of using different protein chips in 
differential protein expression analysis using SELDI mass spectrometry. 

25 Materials and methods 
Samples 

The IMAC study was based on analysis of serum from 12 cancer patients and 35 healthy 
individuals. The other studies (CM10, H50, and SAX 2) were based on studies of analysis of 
serum from 8 cancer patients and 8 healthy individuals. 
30 Cancer serum samples were obtained from cancer patients prior to surgery. Normal serum 
was obtained from a group of healthy individuals matched by age and gender to the cancer 
patients. Serum samples were stored at -80°C until use. Samples were assayed by the 
SELDI-TOF/MS technique (Ciphergen). 



35 Sample preparation 

Samples were pre-treated by applying 5pl of pre-treatment solution to the chip surface 
and the chip was left on shaker for 5 minutes. The pre-treatment solution varies for 
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different chip types. This process was repeated twice. The chip was washed in MQ-water 
twice and once in binding buffer. 

Serum sampies were thawed on ice and 5 nl serum was diluted in 50 H l binding buffer and 
5 left on shaker for 40 minutes. Next the samples were removed and chips were washed 
twice in washing buffer, followed by wash in MQ-water. 

Chips were left to dry at room temp for 20 minutes. 0.6 »\ crystallisation solution was 
applied twice. 



10 Analysis 

The PBS II instrument (Ciphergen) was calibrated prior to use and chips were analysed 
with detector sensitivity and laser intensity at suitable values. 

Data mining: 

All spectra were pooled into one experiment file and were normalised based on total ion 
15 current. Markers were identified by the Biomarker Wizard software (Ciphergen) and 
markers were compared and combined by principal component analysis 

Description of chips used for serum screening. 

As described, the protein chip surfaces are composed of common chromatographic resins 
20 commonly used in other purification techniques: 

■ 

IMAC3Q ProteinChip Array 

The IMAC30 ProteinChip Array is an immobilised metal affinity capture array with a 
nitriloacetic acid (NTA) surface. The IMAC30 ProteinChip Array is activated with transition 
25 metals prior to use. 

CM1 0 ProteinChip Array 

The CM10 ProteinChip Arrays incorporate carboxylate chemistry (negatively charged) that 
acts as a weak cation exchanger. 



30 



35 



H50 ProteinChip Army 

H50 ProteinChip Arrays bind proteins through reversed phase or hydrophobic interaction 
chromatography and have binding characteristics similar to that of a C6 to C12 alkyl 
chromatographic resin. 

SAX2 ProteinChip Array 

The SAX2 ProteinChip Array is a strong anion exchange array with quaternary amine 
functionality. 
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Description of buffers used for binding and washing steps in the serum screening 



The buffer solutions used, are common buffers used in other purification techniques; 



5 IMAC30 screening 
Pre-treatment : 
Binding buffer : 
Washing buffer: 

CM10 screening 
10 Pre-treatment : 
Binding buffer : 
Washing buffer: 

H50 screening 
Pre-treatment : 
15 Binding buffer : 
Washing buffer: 

SAX2 screening 
Pre-treatment : 
Binding buffer : 
20 Washing buffer: 

Resuits 



100 mM NiS04 

100 mM TRIS, pH 7.5; 500 mM NaCI; 0.1% Triton X-100 
PBS, pH 7.5; 700 mM NaCI 



None 

50 mM TRIS, pH 7.5 
50 mM TRIS, pH 7.5 



100% acetonitnie 

PBS, pH 7.4; 10% ACN; 250 mM NaCI 
PBS, pH 7.4; 10% ACN; 250 mM NaCI 

None 

50 mM TRIS, pH 8.0; 0.1% Triton X-100 
50 mM TRIS, pH 8.0; 0.1% Triton X-100 



Table 10 Possible markers detected using different protein chips. 



Chip 


Up-regulated 


Down-regulated 


H50 


11900 Da, 11700 Da, 11650 Da, 
11550 Da, 11500 Da 


46000 Da, 45500 Da, 8940 Da, 
8230 Da, 6650 Da, 6450 Da 


CMIO 


15200 Da, 6125 Da, 5900 Da, 
3275 Da, 2955 Da 


4290 Da, 2450 Da, 1536 Da 


SAX2 


33000 Da, 16150 Da 
15935 Da, 15200 Da 




IMAC30 


60500 Da, 19900 Da, 11080 Da, 
10830 Da, 9140 Da, 8930 Da , 
6110 Da, 6090 Da, 5920 Da, 
5900 Da, 5540 Da, 5330 Da, 
5260 Da, 4460 Da, 2960 Da 


66500 Da, 44300 Da, 28121 Da, 28010 
Da, 28315 Da, 27700 Da, 15580 Da, 
13700 Da, 6880 Da, 6660 Da, 6430 Da, 
4660 Da, 4640 Da, 4330 Da, 4300 Da, 
4290 Da, 4000 Da, 3980 Da , 3960 Da, 
3680 Da, 3280 Da, 3160 Da 
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Only markers with above 70% sensitivity are shown. 
Conclusion 

We have compared the protein population of serum from colon cancer patients with serum 
5 from healthy individuals by different methods (different chips and different binding 

conditions). By the described procedure, we have identified a number of proteins that are 
differentially expressed (either up- or down-regulated) in serum from colon cancer patients 
compared to serum from normal individuals. 

10 We find that the IMAC30 screening gives the prominent results, and the markers obtained 
from these screenings have been shown to have predictive power in discriminating 
between samples from healthy individuals and patients diagnosed with colorectal cancer. 

The difference of markers detected in serum of this study as compared to the study 
15 described in example 1 is based on the state of the samples. The samples of this study 
were freshly frozen and thawed once prior to analysis, whereas the samples from example 
1 have been thawed and refrozen several times. 

The study further shows that some markers are detected on more than one type of chip, 
20 such as the up-regulation of 5900 as well as the down-regulation of 4290 on both CMlo' 
and IMAC. Moreover, the study shows that by using more than one type of chip, the 
number of markers detected by using this technology can be increased considerably. 

EXAMPLE 6 IDENTIFICATION OF BIOMARKERS FOR COLON CANCER BY DATA MINING OF 
25 MASS SPECTRA 



The aim of this study was to separate healthy individuals from colorectal cancer patients 
using a Principal Component Analysis (PCA) on a normalised data set from mass spectra. 

30 Methods 
Samples 

Serum samples were obtained from 12 healthy individuals and 35 patients diagnosed with 
colon cancer and the samples were assayed on IMAC30 chips according to the protocol 
described above in example 5. 

35 

Data mining 

Raw data sets from mass spectra were normalised based on total ion current. 
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Data sets containing m/z, intensity and area of the peaks identified by "biomarker wizard 
were generated as follows: 

* 

Data set 1: 4 healthy individuals and 4 patients diagnosed with colon cancer. 
Data set 2: 8 healthy individuals and 8 patients diagnosed with colon cancer. 
Data set 3: 12 healthy individuals and 35 patients diagnosed with colon cancer. 



Computer programs: 

Ciphergen ProteinChip Software with "biomarker wizard". 
Multi Variate Statistical Program (MVSP), Kovack Computing. 



Parameters 

Biomarker wizard settings: 
15 First pass: 5 

Min peak threshold: 0 
Cluster mass window: 0.3 
Second pass: 2 



20 Principal Component Analysis settings (MVSP): 

Data standardised: Yes 
Data centred: Yes 



Results: 

25 Principal component analysis of data set 1 resulted in two distinct groups, and identified as 
healthy individuals and patients with colon cancer. The separation was on the first principal 
component and all peaks irrelevant for the separation was removed from the analysis. 
Potential markers: 2960, 3170, 3980, 4650, 5340, 5906, 6120, 6840, 6880, 8940, 9140, 
and 28010 were identified. 

30 Principal component analysis of data set 2 resulted in two distinct groups, and identified as 
healthy individuals and patients with colon cancer. 

Potential markers: 1530, 3980, 4650, 5340, 5545, 5906, 6090, 6120, 6880, 11799, 
13745, and 28010 were identified. 

35 The most prominent combination of markers in both data set 1 and 2 were the following 
markers: 3980, 5340, 5906, 6880, and 28010 with 100% sensitivity and 100% specificity. 
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Data set 3 was used to verify the power of the selected markers. 

Table 11: Sample scores of data set 2 using the following markers: 3980, 5340, 5906 
6880, and 28010. 



Sample ID 


Sample scores 
on PCI 


Group 


Chipl7-A 


-0.336 


cancer 


Chipl7-B 


0.64 


normal 


Chipl7-C 


-0.592 


cancer 


Chipl7-D 


0.639 


normal 


Chipl7-E 


-0.248 


cancer 


Chipl7-F 


0.46 


normal 


Chipl7-G 


-0.154 


cancer 


Cr)jpi/-H 


0.148 


normal 


Chlpl7-A(2) 


-0.317 ~ 


cancer 


Chipl7-B(2) 


0.114 


normal 


Chipl7-C(2) 


-0.442 


cancer 


Chipl7-D(2) 


0.591 


normal 


Chipl7-E(2) 


-0.934 


cancer 


Chipl7-F(2) 


0.616 


normal 


Chipl7-G(2) f 


-0.541 


cancer 


Chfpl7-H(2) 


0.356 


normal 



Figure 15 shows a scatter-plot of the sample scores and variable loading of data set 2. The 
figure demonstrates the power of the PCA. 



10 



Table 12 The sensitivity and specificity of data set 3 





Data set 3 


Sensitivity 


84 % 


Specificity 


83 % 



The theoretical example shown here below demonstrates the power of the prediction 
model. 



The intensity and m/z of the 5 markers (3980, 5340, 5906, 6880, and 28010) were then 
15 used on a data set comprising 2 healthy individuals, 2 patients diagnosed with colon 
cancer, and 4 unknown by applying PCA. 
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Table 13 Sample scores from PCA of samples form healthy individuals, cancer patients and 
samples from unknown subjects. 



Sample 
ID 


Group 


Sample 
score on PCI 


Chip25A 


Cancer 


-0.669 


Chip25B 


Healthy 


0.995 


Chip25C 


Cancer 


-0.686 


Chip25D 


Healthy 


0.520 


Chip25E 


Unknown 


0.403 


Chip25F 


Unknown 


-0.425 


Chip25G 


Unknown 


-0.805 


Chip25H 


Unknown 


0.666 



5 Table 14 Numeric distance of sample scores from table 13. 




10 



Conclusion: 

Principal Component Analysis can separate healthy individuals from patients with colon 
cancer using the intensity of the selected markers. 

EXAMPLE 7 A METHOD FOR DISCRIMINATING BETWEEN HEALTHY INDIVIDUALS AND 
PATIENTS WITH COLON CANCER 

The aim of the study was to develop a method for discriminating between healthy 
individuals and patients with colon cancer based on data from mass spectra generated 
15 using protein chips and the SELDI TOF mass spectrometry technique. 
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Data mining 
Data sets: 

Data set A: Intensities of the five serum markers from 24 patients diagnosed with colon 
cancer and 47 healthy individuals. 
5 Data set B: Data set A minus the average of the intensity in healthy individuals. 
The intensities were normalised based on total ion current. 

Data format 

The input data from each sample contained: Sample ID, intensity of 3980, 5340, 5906, 
6880, 28010, and a grouping variable (1 = cancer, 0 = healthy). 

10 

Cut-off values 

Three different cut-off values were analysed (0.4, 0.5, and 0.6). 
Predicted grouping 

15 If the predicted result is above cut-off, the sample is classified as positive for colon cancer 

If the predicted result is below cut-off, the sample is classified as negative for colon cancer 
(0). 

20 Weights 

The number of weights Is 5 (one for each marker). 
The weight is a number between -0.9 and 0.9. 

< 

Calculation 

The program reads the data-file line by line, and stores them. For each combination of 
25 weights and each sample the predicted grouping is calculated: 

Predicted grouping = a * int(3980) + b * int(5340) + c * int(5906)* 0.1 + d * int(6880) + 
e * mt(28010) 

weights = a,b,c,d,e int(3980) = intensity of marker 3980 Da, etc. 

30 

Specificity and sensitivity is calculated, based on the predicted result, cut-off value, and 
grouping variable. 

In order to identify the parameters for predicting cancer from a biological sample using 
35 selected markers, the following algorithm was used: 
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The input-file consists of intensities of the five markers and the desired result (if cancer 
1, if healthy = 0) 

Place all lines from input-file in a list 

5 

A weight can take one of the following: -0.9, -0.8, -0.7, -0.6, -0.5, -0.4, -0.3, 
-0.2, -0.1, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9. 

Make all possible weight combinations: 

10 

for each possible factor that the first weight can take and 
for each possible factor that the second weight can take and 
for each possible factor that the third weight can take and 
for each possible factor that the fourth weight can take and 
15 for each possible factor that the fifth weight can take. 

for each cut-off (0.4, 0.5, 0.6) 

for each possible combination of weights and each input line: 
20 one = intensity of 3980 * first weight 

two = intensity of 5340 * second weight 

three = intensity of 5906 * third weight * 0.1 

four = intensity of 6880 * fourth weight 

five = intensity of 28010 * fifth weight 
25 calculated result - one + two + three + four + five 

if desired result = 1 and calculated result >= cutoff, true positive + 1 
if desired result = 0 and calculated result < cutoff, true negative + 1 
if desired result = 1 and calculated result < cutoff, false negative + 1 
30 if desired result = 0 and calculated result >= cutoff, false positive + 1 



calculate sensitivity and specificity for this combination of weights 

sensitivity = ((true positive / (true positive + false negative)) * 100 
specificity = ((true negative / (true negative + false positive)) * 100 

change specificity and sensitivity into integers 

if sensitivity > 70 and specificity > 70 
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join sensitivity, specificity, cutoff, and weights in one line 
place the line in an array 

When all combinations of cut-off, weights, and input have been explored sort the array. 

5 



Results 

Table 15: Different weight combinations and the corresponding specificity and sensitivity. 



No. 


Data 


Cut-off 


Weights 


Specificity 


Sensitivi 




set 


value 


a 


b 


c 


d 


e 


% 


% 


1 


A 


0.4 


-0.9 


0.3 


0.1 


0.3 


-0.6 


87 


95 


2 


A 


0.4 


-0.9 


0.4 


0.2 


-0.1 


-0.9 


80 


95 


3 


A 


0.5 


-0.9 


0.4 


0.2 


-0.6 


0.2 


85 


91 


4 


B 


0.5 


-0.4 


0.3 


-0.5 


-0.6 


-0.1 


91 


95 


5 


B 


*0.5 


-0.9 


0.2 


0.1 


-0.2 


-0.2 


85 


95 


6 


B 


0,6 


-0,4 


0,3 


-0,5 


-0,7 


0,1 


91 


95 



10 The algorithm used for prediction is as follows: 

Get intensities of the 5 markers for the sample from known healthy individuals in 
the following order: 3980, 5340, 5906, 6880, 28010 Da. 

15 Calculate average intensity of the 5 markers. 

Get intensities of the 5 markers of the test sample. 
Subtract the average intensity calculated above. 

20 Multiply the first intensity with weight a 

Multiply the second intensity with weight b 
Multiply the third intensity with weight c 
Multiply the fourth intensity with weight d 
Multiply the fifth intensity with weight e 

25 

Calculate the sum of the above multiplications. 

If sum < cutoff value => sample is negative for colon cancer. 
If sum > cutoff value => sample is positive for colon cancer. 

30 
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Conclusions 

The program found equations, which had sensitivity and specificity above 90 %. The 
intensity of the marker 5906 is approximately 10 times higher than the other markers. 
Therefore, in order to prevent the 5906 marker to carry more weight than the other 
markers it is multiplied by 0.1. The best performing equations were number 1, 4, and 6. 
This shows that computer algorithms are able to discriminate between healthy individuals 
and patients with colon cancer. With a larger number of samples it would be possible to 
use artificial neural network or other computer algorithms to be trained on the data. This 
might result in increased sensitivity and specificity of the markers. 

EXAMPLE 8 IDENTIFICATION OF BIOMARKERS FOR COLORECTAL CANCER IN TISSUE 
SAMPLES 

Samples 

Tissue samples were obtained from cancer patients after surgery. Tissue samples were 
15 obtained from the removed fragment of the patients colon following surgical treatment for 
colon cancer and were stored at -80 °C until use. 

Sample preparation 

100 mg tissue sample was thawed on ice and homogenised on a Wheaton Overhead Stirrer 
for 2 minutes at speed step 2, in 500 pi Lysis buffer (lOOmM TRIS-HCI, pH 8.0, 9.5 M 
20 UREA, 2% CHAPS). The samples were centrifuged at 14,000 rpm for 10 minutes and the 
pellet was discarded (repeated twice). The tissue protein extracts were stored at »80°C 
until use. Samples were compared by the SELDI-TOF/MS technique (Ciphergen). 

Samples were pre-treated by applying 5pl of pre-treatment solution to the chip surface 
25 and the chip was left on shaker for 5 minutes. This process was repeated twice. The 

solution was removed by washing the chip twice in MQ-water and once in binding buffer. 

Tissue samples were thawed on ice and 10pl tissue sample was diluted in 50 pi binding 
buffer and left on shaker for 40 minutes. Next the samples were removed and the chips 
30 were washed twice in washing buffer, followed by wash in MQ-water. The chips were left to 
dry at room temp for 20 minutes and 0.6 pi of crystallisation solution was applied twice. 
Analysis 

The PBS II instrument (Ciphergen) was calibrated prior to use and chips were analysed 
with detector sensitivity and laser intensity at suitable values. 

35 Data mining 

All spectra were pooled into one experiment file and were normalised based on total ion 
current. Markers were identified by the Biomarker Wizard software (Ciphergen). 
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Description of chips used in tissue screening 

As described, the protein chip surfaces are composed of common chromatographic resins 
commonly used in other purification techniques: 

5 

SAX2 ProteinChip Array 

The SAX2 ProteinChip Array is a strong anion exchange array with quaternary amine 
functionality. 

NP20 ProteinChip Array 

10 NP20 ProteinChip Arrays, mimic normal phase chromatography with silicate functionality. 



Description of buffers used for binding and washing steps in the tissue screening 
The buffer solutions used, are common buffers used in other purification techniques 



SAX2 screening 
15 Pre-treatment 
Binding step 
Washing step 

NP20 screening 
Pre-treatment 
20 Binding step 
Washing step 



100 mM TRIS-HCI, pH 8.0 
100 mM TRIS-HCI, pH 8.0 
100 mM TRIS-HCI, pH 8.0 



50 mM TRIS-HCI, pH 8.0 
50 mM TRIS-HCI, pH 8.0 
50 mM TRIS-HCI, pH 8.0 



Table 16 Possible tumour markers with the following mass values for each protein chip 
type 



Chip 


Up-regulated 


Dow n- reg u la ted 


SAX2 


11987 Da, 5871 Da, 5234 Da, 
4281 Da, 4266 Da, 4039 Da, 
4024 Da, 3408 Da, 2933 Da, 
2878 Da, 2840 Da, 2799 Da, 
2693 Da, 2462 Da, 2364 Da 


15140 Da, 11989 Da, 9600 Da, 9197 Da, 
9079 Da, 8971 Da, 7324 Da, 5075 Da, 
4749 Da, 4634 Da, 3984 Da, 3777 Da, 
2330 Da, 1930 Da 


NP20 


5857 Da, 4264 Da, 3878 Da, 
3712 Da, 3651 Da, 3574 Da, 
3487 Da, 3444 Da, 3372 Da, 
1688 Da 


9700 Da, 8652 Da, 8652 Da, 8580 Da, 
7023 Da, 5360 Da, 4168 Da, 1365 Da, 
1256 Da, 1042 Da, 1026 Da, 1005 Da 



25 
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Table 16 shows a number of putative markers for colon cancer using more than one type 
of chip. Although some markers may be detected using different chip with various surface 
characteristics, most of the markers detected by the different chip types do not overlap. 
This allows for detection of a larger number of markers in the same sample. 



10 



15 



20 



25 



30 



35 



EXAMPLE 9 POSSIBLE TUMOUR MARKERS WITH MOLECULAR WEIGHT CORRESPONDING 
TO IDENTIRED BIOMARKERS 

The aim of this study was to use bioinformatics to associate the identified markers with 
annotated genes with a known function. 

Many of the possible tumour markers have masses that correspond to specific peptides in 
the database. The mass values of the individual tumour markers may in some cases 
correspond to the mass values of specific human proteins in the database. By searching 
with the mass value of each tumour marker, a number of possible hits occur. These hits 
are possible identifications of the proteins. 

Data bases and search engines 

Database : Swiss-Prot (Human) 

Search tool : Tagldent (Expasy) 

Allowed deviation: Up to 0.5% deviation from noted mass accepted 

Table 17 Possible identification of peptides up-regulated in tumour tissue on SAX 2 Chip. 



2364: 


Fragment of human serum albumin/ alpha-fetoprotein ' 

(seq : FLGM FLYEY ARRH PDYS VV) (SEQ ID NO 1) 


2462: 


APMl HUMAN (P35318) POLG HRV14 (Pn^ 3 03) REL3 HUMAfM (Oftwyp^) 


2693: 


MOT! HUMAN (Pl?ft7?) " 


2799: 


HEPC HUMAN (PR1177) 


2839: 


No hits " " " 


2878: 


No hits " " — 


1 2933: 


TERA HUMAN (P^n79) " 


3112: 


No hits " ~" " — — 


3408: 


CALO HUlvjAN (Pnn^) "~" A 


4024: 


COPA HUMAN. (P53621) NEU2 HUMA N rpnn^) "H 


4039: 


COPA HUMAN (P53621) D£F6 HUMAN (Q01 524) NEU2 HUMAW (Pnn^j 

PYY HUMAN (Pmnoo) 


4266: 


NEUY HUMAN. (P01303) RFRP HUiv^N (™H™7) 
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4281: 


NEUY HUMAN (POI^m) " ' ' j 


5234: 


CART HUMAN (Q16568) CATB HUMAM (Prv7 8 58) STAT HUMAN (Pn?«n ?) 


5871: 


GAG HV1A2 (P03349) GAG HV1C4 (Pf^ 8 87) GAG HV1J3 (P124Q4) 


Table 18 Possible identification of peptides down-regulated in tumour tissue on SAX2 chip 


Biomarker Entry Name (primary accession number) 


1930: 


No hits 


2330: 


POLG_HRV16 (Q82122) POLG_HRV89 (P07210) " ~ 


3777: 


APP1_HUMAN (P51693) CAL1_HUMAN (P06881) GLUC_HUMAN (P01275) 
SARL_HUMAN (000631) 


3984: 


TKN1_HUMAN (P20366) 


4634: 


CCKN_HUMAN (P06307) 


4749: 


CRF_HUMAN (P06850) 


5075: 


CMGA_HUMAN (P10645) CMGA_HUMAN (P10645) TYBN_HUMAN (Q99406) 


7324: 


POLG_HRVlB (P12916) YU01_HUMAN (Q9BTD5) (+5 other hits) 


8971: 


>10 hits ~ " ' — 


9079: 


>10 hits " - — 


9197: 


>10 hits : 


9600: 


>10 hits ~ — 


15140: 


>10 hits 


Table 19 Possible identification of peptides up-regulated in tumour tissue on NP?n rhi P 


Biomarker 


Entry Name (primary accession number) 


1688: 


NEUT HUMAN (P30990) S112 HUMAN (Pftn^n) 


3372: 


CU89 HUMAN (P59042) DEFJ. HUMAN (PS9665) DEF3 HUMAN (PRQfififi) 
GLUC HUMAM (Pm?7<;) 


3444: 


I>EF1 HUMAN (P59665) (+6 other hits) 


3487: 


DEF3 HUMAN (P59666) GLUC HUMAN (P01275) NEUY HUMAM (Pm 
PTHR HUMAN (P1 9777) 


3574: 


OREX HUMAN. (043612) PISD HUMAN (Q9UG56) PNOC HUMAN (Q1^1Q) 
SEMI HUMAN (Pn^7o) 


3651: 


CMffiA HUMAN (P1 0645^ PRRP HUMAM (Pftn?7) 


3712: 


A4 HUMAN (P05067) DEF4 HUMAN (P17838) PSPC HUMA.N (P1 ifi«ft) 


3878: 


CCKN HUMAN (P06307) EXXK ADFO? (Pm 2 42) GAST HUMAN (pnnRn) 


4264: 


NEUY HUMAN (P01303 - ) RFRP HUMAN (QQHrrv?) — 



SUBSTITUTE SHEET (RULE 26) 



WO 2004/090550 



PCT/DK2004/000263 



48 



10 



15 



5857 


: GAG HV1C4 (P05887) GAG HV1J3 (P1?494) PRP2 HUMAN (P0?R1 9) 


Table 20 Possible Identification of peptides down-regulated in tumour tissue on NP20 chip 




»di Ker entry name (primary accession number) 


1005 

J- V \J «J 


• f^Alll Ml IMAM /r>nioiro'\ 


1026 

J- \J C~ \J 


• IVU III L i> 


1042 




1256 


: GON2 HUMAN (f1d^«) 


1365: 


NPJF£..HUMAN (0151.30) PIV6 ADE1? (p^qrr) 


4168; 


PAHO HUMAN (P01 298) POLN LORDV (P^afi^i) 


5360: 


COXO HUMAN (P1 t;Q^4) 


7023: 


RPCX HUMAN (pc^ftm) 


8580: 


>10 hits 


8652: 


SZ10 HUMAN (P07778) (+5 other hits) ~ 


9700: 


>10 hits 



Results and conclusion 

It should be noted that the hits may not necessarily refer to the full length protein encoded 
by the specified gene, but in many cases to a specific peptide produced by alternative 
20 splicing or post-translational processing, hence one mass value may produce more than 
one hit within one gene. 

The results show that some of the markers identified in the examples listed above can be 
linked to proteins, which have been associated with tumour initiation, tumour growth or 
25 tumour progression, such as Def 1 and 3 as well as Cathepsin B. 



Furthermore, it should be noted that some of the markers detected by the mass 
spectrometry might reflect degradation products of larger proteins. 



30 EXAMPLE 10 USE OF DEFINSINS AS SERUM MARKERS FOR CANCER DETECTED BY MASS 
SPECTROMETRY 

SELDI-TOF/MS (Surface Enhanced Laser Desorption/lonisation-Time Of Flight/Mass 
Spectrometry) protein profiling was used to demonstrate that the expression of human 
neutrophil peptides -1, -2 and -3 (HNP 1-3), also known as alfa-defensin-l, -2 and -3, is 
35 up-regulated in colon tumour tissue relative to normal colon tissue. Further, by comparing 
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serum from colon cancer patients with serum from a group of healthy Individuals, we show 
that this abnormal HNP 1-3 expression is reflected in colon cancer serum. 

The tissue screening was performed on NP20 chip, whereas the serum screening was 
5 performed on SAX2 chip. 

NP20 ProteinChip Array 

NP20 ProteinChip Arrays, mimic normal phase chromatography with silicate functionality. 
Pre-treatment : 50 mM TRIS-HCI, pH 8.0 

10 Binding step : 50 mM TRIS-HCI, pH 8.0 

Washing step : 50 mM TRIS-HCI, pH 8.0 

SAX2 ProteinChip Array 

15 The SAX 2 ProteinChip Array is a strong anion exchange array with quaternary amine 
functionality. 

Pre-treatment : 100 mM TRIS-HCI, pH 8.0 

Binding step : 100 mM TRIS-HCI, pH 8.0 

Washing step : 100 mM TRIS-HCI, pH 8.0 

20 

The Defensin screening was performed by as described for the general serum / tissue 
screenings. The expression of three peptides with mass/charge ratio (m/z) values of 3372, 
3443 and 3486 (+/- 0.1%) were found to be up-regulated in the tumour samples 
compared to the samples and up-regulated in serum from patients with colon cancer when 
25 compared with serum from healthy individual. The three peptides were subsequently 
identified as HNP 2, 1 and 3, respectively. This was done by peptide mapping (trypsin 
digest) and reduction with DTT„ 

EXAMPLE 11 EXPRESSION OF ALFA-DEFENSIN-1, -2 AND -3 IN SERUM AND TISSUE OF 
30 COLON CANCER PATIENTS 

The aim of this study was to define the relationship of the expression of human neutrophil 
peptides -1, -2 and -3 (HNP 1 -3) and colon cancer. 

Materials and methods 
35 Tissue screenino 

Tissue samples were obtained from the removed fragment of the patient's colon following 
surgical treatment for colon cancer and were stored at -80°C until use. 100 mg tissue 
sample was thawed on ice and homogenised on a Wheaton Overhead Stirrer for 2 minutes 
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at speed step 2, in 500 pi Lysis buffer (lOOmM TRIS-HCI, pH 8.0, 9.5 M UREA, 2% 
CHAPS). The samples were centrifuged at 14,000 rpm for 10 minutes and the'pellet was 
discarded (repeated twice). The tissue protein extracts were stored at 80°C until use. 
Minor pilot studies were performed on different chips (data not shown) and the NP20 
5 (Normal Phase) (Ciphergen) chip was chosen for the tissue screening. NP20 chips was 
placed in bioprocessor and pre-treated with 50 pi tissue binding buffer (50 mM TRIS-HCI, 
P H 8.0) for 5 minutes on shaker (250 rpm) (repeated twice). 5 pi tissue protein extract ' 
was diluted In 50 pi tissue binding buffer and incubated in bioprocessor on IMP20 chips for 
40 minutes at room temperature on shaker (250 rpm). Spots were washed twice in 250 pi 
10 tissue washing buffer (50 mM TRIS-HCI, pH 8.0) for 5 minutes. The chips were air-dried 
for 20 minutes, followed by treatment with two times 0.6 pi 100% SPA matrix solution. 

Serum srrppnlnn 

Cancer serum samples were obtained from cancer patients prior to surgery. Normal serum 
15 was obtained from a group of healthy individuals matched by age and gender to the cancer 
patients. Serum samples were stored at -80°C until use. Serum pilot studies were 
performed on different chips to monitor the presence of HNP 1-3 in serum (data not 
shown). The immobilised metal affinity capture (iMAC30) chip was chosen for the actual 
screening and pre-treated with nickel before analysis: 5 pi 100 mM NiS04 were added to 

20 each spot and left on shaker (150 rpm) for 5 minutes (repeated twice). The chips were 
placed in bioprocessor and incubated with lOOpI MQ for 5 minutes on shaker (250 rpm). 
Each spot was treated with 50 pi serum binding buffer (100 mM TRIS-HCI, pH 7.5, 500 mM 
NaCI, 0,lo/o Triton X-100) and left on shaker for 5 minutes (250 rpm). Serum samples 
were thawed on ice and 5 pi serum was diluted in 50 pi serum binding buffer and applied to 

25 spots and left on shaker (250 rpm) at room temperature for 40 minutes. Samples were 
removed and spots washed twice in 200 pi serum washing buffer (100 mM PBS, pH 7.4, 
700 mM NaCI), followed by one wash in 200 pi MQ-water. The chips were removed from 
the bioprocessor and left to air dry for 20 minutes followed by treatment with two times 
0.6 pi SPA (100%). Only freshly made matrix solutions were used and the instrument was 

30 calibrated prior to use. Cancer and normal samples were run side by side. The chips were 
analysed on a PBSII instrument (Ciphergen). All spectra in each screening were normalised 
based on total ion current. 



Purification and identification of HNP 1-^ 
35 100 pi protein extract from cancer tissue in tissue lysis buffer was loaded unto a RP-HPLC 
column (uRPC C2/C18 ST 4.6/100, Pharmacia Biotech, Flow rate: 0.5 ml/mln, Fraction 
size: 0.5 ml) in buffer A (0.065% Tri-flouro-aceticacid (TFA) in MQ-water) and proteins 
were eluted in a gradient of 0-100% buffer B (0.05% TFA in acetonitrile (ACN)). Elution of 
peptides was monitored by absorption spectrometry (OD280). All protein containing 



SUBSTITUTE SHEET (RULE 26) 



WO 2004/090550 PCT/DK2004/000263 

51 

fractions were analysed by MALDI-TOF (Matrix Assisted Laser Desorption/Ionization-Time 
of flight) on the PBS II instrument: 1.5 nl fraction was incubated with 0.6 M l SPA (100%) 
on a Gold array (Ciphergen) and left to crystallise on chip, followed by an additional 0.6 ix\ 
SPA (100%) and the Gold array was analysed by MALDI-TOF. The HNP 1-3 containing 
5 fraction (32% buffer B) was further purified on a peptide gel-filtration column (Superdex 
Peptide HR 10/30, Pharmacia Biotech, Flow rate 0.9 ml/min, Fraction size: 0.5 ml, Buffer: 
50% ACN, 0.1 % TFA). Elution of peptides was monitored by absorption spectrometry 
(OD280) and protein containing fractions were again analysed by MALDI-TOF on the PBS II 
instrument as described. Purified HNP 1-3 was subjected to on-chip trypsin digestion. 10 jd 

10 HNP 1-3 fraction was applied to NP20-chip and left on shaker (250 rpm) at room 

temperature for 40 minutes. Sample was removed and spot was washed twice with 10 M l 
water (on-chip purification step). In order to denature peptides prior to digestion, the chip 
was left on heating block (80 C) for 5 minutes. The chip was cooled on Ice for 2 minutes. 
10 nl trypsin digestion solution (O.Olug/pl trypsin in 50 mM NH4HCOS, pH 8.0) was added 

15 and the chip was left for 10 hours at 40°C in humidity chamber after which the chip was 
left to air dry. for 20 minutes. 1 n» CHCA (100%) was added and the peptide map was 
analysed on PBS II instrument. Identification was done by the use of Pepldent on the 
Expasy server. 

20 Size exclusion chromatograp h y of HNP 

50 urn colon cancer serum was loaded unto a peptide gel-filtration column (optimal 
separation range: 1 to 7 kDa, flow rate: 0.5 ml/min, fraction size: 0.5 ml, buffer: 10 mM 
Ammonium carbonate, pH: 8.0). Elution of peptides was followed by absorption 
spectrometry (OD280). All protein-containing fractions were analysed by MALDI-TOF on 

25 PBSII (Ciphergen) as described above. Maximum signal intensity of 40 individual peaks 
was plotted as a function of elution volume and an approximate elution curve was 
calculated. 

Functional gi-nHy of HNP 1-3 hv mlrrnflr.^ 

30 For micro flow experiments, MDCK cells were plated onto poly-d-lysine coated cover slips 
at a concentration 3000cells/well, grown in DM EM with 10% FBS for five days with the 
result of confluent islands. Microflow was performed in an Eppendorf micromanipulator 
5171 and transjector 5246 system mounted on a Leica DMIRBE inverted research 
microscope. Micro capillaries (borosilicate with filament, Sutter Instruments Company, 

35 Novate, California, USA) were pulled to an outer diameter of 0.85 nm on a Sutter P-97 
Micropfpette Puller. The dye-loaded cells were visualised by excitation at 470 nm and 
recorded at 509-nm emission using Haupage version 3.3.18038 software and Kappa CF 
15/4 MC-S camera (Leica). The MDCK cells were recorded (in C02 independent media) on 
the inverted DMIRBE inverted research microscope. The capillary was placed 20 nm over 
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the confluent cells with a constant flow (1300 hPa) of calcein (20mM). The MDCK cells 
were exposed to peptide fractions purified from colon tumours by size-exclusion 
chromatography. 

Results 

5 HNP1-3 expression in tissue and 5w»ri |m , 

Pilot studies of colon tumour and normal colon tissue was performed on a variety of chips 
with different chemical properties and under different binding and washing conditions. 
Based on these preliminary studies, the expression of three peptides with mass/charge 
ratio (m/z) values of 3372, 3443 and 3486 (+/- 0.1%) (subsequently identified as HNP 2, 

10 1 and 3, respectively), were found to be up-regulated in the tumour samples. The three 
peptides were visible on different chips and under different binding conditions (data not 
shown). However the strongest signals of HNP 1-3 in tissue extract were obtained on the 
NP20 (Normal Phase) chip, whereas the strongest signal of HNP 1-3 in serum was 
observed on the iMAC30 (immobilised metal affinity capture) chip activated with nickel, 

15 and these conditions were chosen for the actual screenings. Protein extract from 40 colon 
tumour and 40 normal colon tissue samples were analysed on NP20 chips and 125 colon 
cancer serum samples and 100 normal serum samples were analysed on iMAC30 chips. All 
spectra in each screening were pooled and normalised based on overall ion current. Each 
spectrum produced approximately 40 to 60 protein peaks in the range from 2 to 80 kDa 

20 (FIG. 16A-C). Statistical analysis of the intensity values of HNP 1-3 in the tissue screening 
(FIG. 17A showed) that HNP 1-3 were significantly up-regulated in tumours (p<0.0005). 
Similarly, statistical analysis of HNP 1-3 expression in the serum screening (FIG. 17B.) 
showed that HNP 1-3 were significantly up-regulated in cancer serum also (p<2.2e-16). 
Compared to other peptides in the same range, HNP 1-3 showed average signal intensity 

25 in most normal colon tissue extract, whereas the HNP 1-3 signal was extremely high in 

most tumour samples (in some tumour samples the HNP 1-3 was the most prominent of all 
detected peptides). On the contrary, in the normal serum samples the HNP 1-3 signals 
were relatively low, and only slightly, but still significantly, higher in the cancer serum. 
This difference between the HNP 1-3 signal In the tissue screening performed on the NP20 

30 chip and serum screening performed on the iMAC30 chip was not due to the different chips 
used in the screenings, since the HNP 1-3 signal in serum was relatively low on the NP20 
chip also (data not shown). Thus, even though most tissue samples were "contaminated" 
with blood, the vast majority of the HNP 1-3 signal originated from the tumour 
microenvironment. This was verified by gel-filtration analysis of tissue extract versus 
35 serum. HNP containing fractions from tissue analysis were far more concentrated 

(approximately xlO) than the same fractions in serum analysis, as seen by MALDI-TOF 
analysis (data not shown). 
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Identification of HIMP 1-3 

The three possible markers were purified by RP-HPLC, peptide gei-filtration and on-chip 
purification, after which they were identified by peptide mapping as HNP-2 (3372 Da), 
HNP-1 (3442 Da) and HNP-3 (3486 Da) (Table 1A.). The measured masses correspond to 
the peptides in their oxidised states, with three disulphide bridges. After heat denaturation 
(10 minutes, 80°C) and treatment with DTT (200mM DTT, room temperature, 30 
minutes), HNP-l and HNP-2 increased 6 Dalton in mass, due to reduction of the six 
cysteines (Table IB). We were not able to reduce HNP-3, due to degradation during the 
reduction process. 



Size exclusion chromatography of HNP 1-3 

50 nl colon tumour extract in Lysis buffer was applied to a peptide gel-filtration column. 
Elution of peptides was followed by absorption spectrometry (OD280). All fractions were 
analysed by MALDI-TOF on PBSII (Ciphergen). Maximum signal intensity of 40 individual 
15 peaks was plotted as a function of elution volume and an approximate elution curve was 
calculated (FIG. 18). HNP 1-3 peptides were found to be eluted primarily together with 
high mass proteins above 20 kDa and to a lesser degree in fractions together with other 
peptides of the same mass interval (FIG. 18). 

20 Cvtoxic assay 

The cytotoxicity of HNP 1-3 purified from colon tumours was tested by exposing MDCK 
cells to different fractions purified from colon tumours. Calcein were added to the fractions 
and the solutions were left to overflow the cells for one hour. By fluorescence microscopy 
calcein was observed to accumulate only in cells exposed to HNP 1-3/calcein fractions, 
25 whereas cells treated with fractions containing other (unidentified) tumour peptides did not 
uptake calcein (FIG. 19 C&D). Further, by normal microscopy, we observed that cells 
exposed to HNP 1-3 appeared more diffuse and had enlarged nuclei, indicating apoptosis 
(FIG. 19 A&B). 



30 Discussion 

Elevated concentrations of HNP 1-3 in colon cancer serum 

Abnormal concentration of HNP 1-3 in body fluids has previously been demonstrated. 
Elevated concentrations of HNP 1-3 following infection (bacterial-/non-bacterial- infection 
and pulmonary tuberculosis) has been found in plasma, blood and a number of body fluids 
35 and plasma HNP 1-3 concentrations have been shown to be elevated in patients with 
septicaemia or bacterial meningitis. HNP 1-3 have been found in urine from patients with 
transitional cell carcinoma of the bladder and in salvia of patients with oral carcinomas. 
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Our study is the first that demonstrate elevated concentrations of HNP 1-3 in serum 
following tumour growth. 

Elevated concentrations of hm p 1-3 in colon tumour* 
5 HNP expression has previously been linked to different types of tumours and cell lines. 
HNP-1 has been detected in lung tumours and in the submandibular glands of patients with 
oral carcinomas. By RT-PCR, mass spectrometry and flow cytometric analysis, HNP 1-3 
have been shown to be expressed by cell lines deriving from renal cell carcinomas and the 
expression of a specific HNP precursor peptide has been shown to be up-regulated in 

10 human leukaemia cells. In a study of squamous cell carcinomas of the human tongue it 
was suggested that the tumour expressed HNP 1-3 originated from tumour invading 
neutrophils. Since our tissue screening is based on comparison of whole tissue samples, 
the up-regulated expression of HNP 1-3 may not necessarily originate from the colon 
cancer cells, but could originate from tumour infiltrating neutrophils. HNP 1-3 are known to 

15 stimulate bronchial epithelial cells to up-regulate interleukin-8 production, a potent 
neutrophil chemotactic factor and HNP 1-3 are also capable of regulating the systemic 
immune response (discussed below). Thus, the up-regulated expression of HNP 1-3 in 
colon tumours may primarily originate from invading neutrophils, but could be initiated by 
HNP 1-3 produced by cancer cells. Even though the signal intensity in mass-spectrometry 

20 can not directly be interpreted as a measure of protein concentration, our results suggests 
that HNP 1-3 are very abundant in colon tumours. This is in agreement with the study of 
HNP-1 in lung tumours, where the maximum observed level was 26 nano-moles per gram 
wet tissue. It follows, that in order for these excessive amounts of peptide to be detectable 
in serum, the peptides must be released from the cells. This is in agreement with studies 

25 of HNP 1-3 expression in kidney and brain. 

Size exclusion chromatography of HNP 1-3 

We explain the elevated concentrations of HNP 1-3 in colon cancer serum by unspecific 
binding between HNP 1-3 and high mass serum proteins. We believe the peptides attach to 

30 serum proteins in the tumour area and are carried into the bloodstream. Even though the 
HNP 1-3 we observe in high mass fractions from size exclusion, could also be explained by 
multimerisation, we interpret the size exclusion results as evidence for interaction between 
HNP 1-3 and unidentified high mass proteins through unspecific interactions. In one study, 
it was demonstrated that Defenslns form voltage dependent channels in lipid bi-layer 

35 membranes, supported by further conductance investigations, suggested that the channels 
were formed by multimers containing 2-4 molecules and a crystal structure study of HNP-3 
revealed an amphiphilic dimer. We add to the growing realisation that common plasma 
proteins bind disease specific peptides and therefore should not be ignored in marker 
research. Our size-exclusion results are in agreement with a number of previous studies 
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that show that HNP's are bound to plasma protein in vitro and that high concentrations of 
HNP's causes precipitation of plasma proteins, specifically 2-macroglubulin and CI 
complement has been shown to bind Defensin. Another study showed that HNP-l bind to 
various serum proteins, notably serum albumin, and it was found that serum, or serum 
5 albumin, was able to inhibit the anti-viral activity of HNP-l. This ability to bind to serum 
proteins could also explain why HNP 1-3 lysis of mammalian cells is hindered in the 
presence of serum. 

Common to beta-Defensin 2, another member of the Defensin family, and HNP 1-3 is an 
10 uneven distribution of surface charges. Beta-Defensin 2 has been shown to bind to a 

chemokine receptor and it has been suggested that the positively charged cluster, which is 
also shared by chemokines, may play a common role in binding to receptors in general, 
but is not important for determining receptor specificity. The same surface charge could 
also explain the binding of HNP 1-3 to plasma proteins. The observation that Defensins are 
15 localised to lymphocyte nuclei could similarly be explained by unspecific binding to shuttle 
proteins. 

HNP 1-3 - cytotoxic peptides 

The exact concentration of HNP's in the tumour microenvironment may have profound 
influence on the in vivo function of HNP 1-3. One study shows that HNP 1-3 mediates lysis 
of tumours in a concentration dependent manner. This is in agreement with another study 
that show that only relatively high concentrations of HNP-l (10-4 M) are cytotoxic for 
human monocytes, whereas lower concentration of HNP-l (10-8 to 10-9 M) increases TNF- 
alpha production by monocytes. In a study of renal cell carcinoma lines it was shown that 
HNP 1-3 were cytotoxic to all tested cell lines when present in high concentrations (above 
25 ug/ml), but at lower concentration HNP 1-3 stimulated growth of a subset of tumour 
cell lines. We add to the established theory that HNP 1-3 are cytotoxic to mammalian cells, 
by demonstrating that HNP 1-3 purified from colon tumours are capable of lysing MDCK 
cells. Our study was based on a 30 minutes microflow study and did not allow us to 
investigate the minimum concentration of HNP 1-3 necessary for lysis. 

Conclusion 

The high concentration of HNP 1-3 observed in tumours and the observation that HNP 1-3 
are capable of lysing mammalian cells leads to the immediate conclusion that the peptides 
serve to the benefit of the host by primarily killing tumour cells. However, HNP 1-3 bind to 
HLA-Class II molecules and are capable of reducing the proliferation of a H LA- DR- restricted 
T-cell line after stimulation and could in this way help the tumour avoid immune 
recognition. Defensins also regulate the systemic immune response. Through interaction 
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with the chemokine receptor CCR6, beta-Defensins recruit dendritic cells and T cells and 
HNP 1-3 are capable of recruiting leukocytes to sites of infection in mice. Up-regulated 
immune responses are known to stimulate tumour proliferation: immune cells are actively 
recruited by tumours to exploit their pro-angiogenic and pro-metastatic effects. Whether 
5 the high concentrations of HNP 1-3 in the tumour limits the tumour growth or on the 
contrary stimulate tumour proliferation is not clarified. Recently, it was found that the 
excess amounts of HNP 1-3 observed in urine from bladder cancer patients was produced 
by the actual bladder cancer cells, (and not by tumour infiltrating neutrophils), and that 
highly invasive bladder cancer cells produced more HNP 1-3 than less invasive ones. We 
10 suggest that the prominent surface charge on Defensins, their ability to bind to high mass 
proteins and the observed excess amounts of peptides seen in tumours, could provide the 
peptides with broad antagonising effects, that may influence numerous receptors in the 
tumour microenvironment. 



15 EXAMPLE 12 IDENTIFICATION OF BIOMARKERS FOR COLORECTAL CANCER BY PLASMA 
INVESTIGATIONS 



20 



The plasma screening was performed on IMAC30 chips according to the protocol used for 
serum screening described above on IMAC30 chips, with the exception of adding 5 pi 
plasma instead of 5 pi serum to the binding buffer. 

Table 21 Possible tumour markers with the following mass values from plasma samples of 
colon cancer patients 



Up-regulated in 
cancer plasma (m/z) 


Down-regulated in 
cancer plasma (m/z) 


3895 


6435 


4136 


6635 " 


4480 


8931 " 


4977 


64860 


5266 


66300 


5905 


66500 


7469 


66800 


9950 




11723 




13747 




13870 




14030 




14100 
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14300 




14470 




19865 




19966 




60475 




60730 

• 





EXAMPLE 13 IDENTIFICATION OF BIOMARKERS FOR COLON CANCER BY DATA MINING OF 
MASS SPECTRA FROM PLASMA 



The aim of this study was to separate healthy individuals from colorectal cancer patients 
using a Principal Component Analysis (PCA) on a normalised data set from mass spectra. 



Methods 
10 Samples 

Plasma samples were obtained from 16 healthy individuals and 16 patients diagnosed with 
colon cancer and the samples were analysed on IMAC30 chips according to the protocol 
described above in Example 12. 



15 Data 

Two data sets containing m/z and intensity of the peaks identified by "biomarker wizard" 
were generated. The first data set contained half of the spectra. The second data set 
contained all spectra. Spectra were pooled and normalised based on total ion current in the 
two data sets. 



20 



Computer programs: 

Ciphergen ProteinChip Software with "biomarker wizard". 
Multi Variate Statistical Program (MVSP), Kovack Computing. 



25 Parameters 

Biomarker wizard settings: 
First pass: 5 
Min peak threshold: 0 
Cluster mass window: 0.3 
30 Second pass: 2 
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Principal Component Analysis settings (MVSP): 
Data standardised: Yes 
Data centred: Yes 



5 



Results: 

Potential markers from a principal component analysis of the first data set: 1455, 1500, 
1532, 1573, 1704, 1725, 3445, 3545, 3895, 4136, 4480, 4977, 5266, 5910, 6110, 6435, 
6635, 6673, 8931, 9015, 9173, 9950, 10838, 11723, 13747, 13870, 19865, 28028, 
10 32490, 33233, 50820, 60638, 65706, 66213, and 79155 Da. 

The following combinations of markers yielded 100 % sensitivity and 100 % specificity: 
3895, 6110, 8931, and 6635 Da. 
6110, 8931, and 6635 Da. 
19865, 13747, 8931, and 9015 Da. 
15 8931, 9015, 33233, and 13747 Da. 

19865, 13747, 8931, 9015, and 33233 Da. 

Principal component analysis on the second data set yielded the following potential 
markers: 1573, 1704, 1725, 6435, 6673, 9015, 9173, 10838, 11341, 11723, 13747, 
20 13880, 28028, and 50825 Da. 

The most prominent combination of markers was the following: 9173, 11728, and 13880 
Da with 100 % specificity and 100 % sensitivity. 

Conclusion: 

25 Principal Component Analysis can separate healthy individuals from patients with colon 
cancer using the intensity of the selected markers. 

As presented in Example 9, a peptide of mass 2364 is up-regulated in tumour tissue when 
analysed on SAX2 Chip (table 17, line 1). This peptide was purified (by RP-HPLC and 

30 peptide-gel-filtration) and subsequently identified by ESI-MS/MS. The peptide was found to 
consist of the following sequence: FLGMFLYEYARRHPDYSW (m/z 2363.7) SEQ ID NO 1. 
This sequence corresponds to a fragment of human serum albumin, demonstrating that 
human serum albumin is excessively degraded in colon tumour samples compared to 
normal colon tissue samples and thus supports the results that show that there is an 

35 abnormal degradation of serum albumin in serum from cancer patients 
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EXAMPLE 14 PROTEOLYTIC DEG RATION OF COMMON BLOOD PROTEINS AS A MARKER FOR 
CANCER 

Abnormal protease activi ty in colon cancer gecum 
5 When serum is analysed on the IMAC30 chip (as described in the procedure for the serum 
screening) two high mass proteins are found to be differentially expressed (as described in 
the results of the serum screening): a protein with m/z: 66500 is down-regulated in cancer 
serum whereas a protein with m/z: 60500 is up-regulated in cancer serum (see table 10). 

10 The protein of 66500 is human serum albumin (HSA) (ALBU_HUMAN (P02768)) 

The theoretical mass of HSA is 66472 Da, well within 0.1% of the observed mass of 66500 
Da. The peak at 66500 is an easily identifiable and prominent peak of high intensity, often 
observed in mass spectrometry analysis of biological samples and any person familiar with 
mass spectrometry would immediately identify the prominent peak at 66500 as serum 

15 albumin. 

Therefore, we show that HSA is present in lower amounts in serum from cancer patients 
than in serum from normal individuals. 

20 The protein at 60500 appears in a reverse proportional manner to HSA: in the normal 

serum where there is high amounts of HSA, there is only little amount of 60500, and in the 
cancer serum where there is relatively low amounts of HSA, there is relatively high amount 
of 60500. 

25 From this we conclude that 60500 is a degradation product of HSA, that is produced when 
a fragment of approximately 6000 Da is lost from HSA. 

HSA is produced in the liver which is not influenced by tumour growth in the colon, at least 
not at this stage in the disease, and the observation, that there is relatively more HSA in 

30 serum from normal individuals than in serum from cancer patients, can therefore not be 
explained by an altered expression of HSA by liver cells. The only meaningful explanation 
for this abnormality is altered proteolytic degradation of HSA in serum from cancer 
patients. Since the proteolytic product, in this case the HSA fragment at 60500, is also 
present in serum from normal individuals, albeit at lower amounts than in serum from 

35 cancer patients, the exact proteolytic mechanism responsible for the specific degradation 
of HSA leading to the production of 60500 is not unique to serum from cancer patients. 

Therefore, our results show direct evidence for altered proteolytic activity in cancer serum. 



SUBSTITUTE SHEET (RULE 26) 



WO 2004/090550 



PCT/DK2004/000263 



60 



Finally, as presented in Example 9/ a peptide of mass 2364 is up-regulated in tumor tissue 
when analysed on SAX2 Chip (table 17, line 1). This peptide was purified (by RP-HPLC and 
peptide-gelfiltration) and subsequently identified by ESI-MS/MS (as described in example 
15). The peptide was found to consist of the following sequence: FLGMFLYEYARRHPDYSW 
5 (m/z 2363.7). This sequence corresponds to a fragment of human serum albumin 

demonstrating that human serum albumin is excessively degraded in colon tumor samples 
compared to normal colon tissue samples. This supports the results that show that there is 
an abnorm degradation of serum albumin in serum from cancer patients. 

10 Identification of serum/plasma m^r^r 9 8 04Q/2fin7>y ?ftni n 

By HPLC, gel purification and trypsin peptide mapping we positively identify 
28040/28025/28010 as a po lipoprotein (P02647). 

15 Results: 



Best match: 

# peptide 
matches 



AC 



ID 



Description 



0.43 



3 



CHAIN 1 : Apolipoprotein A- 
PQ2642 APA1_HUMAN_1 I. - Homo sapiens 5.27 

(Human). 



28078.62 



Peptide map: 



| user | matching 



position 



20 




peptide 



I 



THLAPYSDELR (SEQ ID " 
NO 2) 

LSPLGEEMRDR (SEQ ID 
NO 3) 



QKVEPLRAELQEGAR (SEQ 
ID NO 4) 



DLATVYVDVLKDSGR 
DYVSQFEGSALGK (SEQ 
ID NO 5) 



Apolipoprotein information: 
Function: Participates in the reverse transport of cholesterol from tissues to the liver for 

t^lT"^ Pr ° m i ° tln9 chol ^erol efflux from tissues and by acting as a cofactor for the 
lecithin cholesterol acyltransferase (LCAT). 

25 Subcellular location: Secreted. 
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Tissue specificity: Major protein of plasma HDL, also found in chylomicrons. Synthesized in 
the liver and small intestine. 

As discussed above, abnormal concentrations of common plasma/serum proteins produced 
5 by the liver will probably not be due to altered transcription/translation of the relevant 
gene, but instead a consequence of abnormal proteolytic activity. 

EXAMPLE 15 IDENTIFICATION OF PEPTIDES FROM COLON CANCER MARKERS 

The purpose of this project is to identify a number of peptides which have been found in 
10 blood serum and which are identified as markers for colon cancer. 

Analysis 

Two samples were purified, wherein one sample contained two peaks. Each sample was 
initially analysed by MALDI-TOF to establish the molecular weight of the components and 
to have an estimate on the amount of peptide present in the sample. 

15 

The peptides of interest, found during MALDI analysis, were fragmented by both MALDI- 
TOF/TOF and ESI-MS/MS analysis. 

Sample 1 (containing the 5901 Da peptide) was purified by reversed phase HPLC and each 
20 fraction was analysed by MALDI-TOF to locate the fractions containing the 5901 Da 

peptide. The fractions containing the peptide were pooled and analysed both directly by 
MS/MS analysis and further purified by ID SDS gel electrophoresis. The band at 6000 Da 
was cut out, digested with trypsin and analysed by MALDI-TOF and TOF/TOF. 

Instruments 
25 Bruker Reflex IV (MALDI-TOF) 

Bruker Ultraflex (MALDI-TOF/TOF) 

Micromass Ultima (nanoLC-MS/MS) 

Applied Biosystems Vision Workstation (HPLC) 

30 Results 

Human serum sample (300 \x\) was purified by reversed phase HPLC. The three fractions 
containing the 5900 Da peptide were pooled and analysed by MALDI-TOF. The final fraction 
contains 4 major peaks; MH + at 4961.8 Da, 5333.5 Da, 5901.1 Da and 6187.05 Da. 

35 The pooled fractions were dried down and loaded on a SDS PAGE gel. The gel band of 
interest was cut out of the gel, reduced and alkylated, and digested with trypsin. 
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The digest sample was micro-purified over a graphite/carbon column. A peptide fingerprint 
was made. One peptide (MH + 1190.5) was selected for MALDI-TOF/TOF analysis. Database 
search of the fragmented peptide gave a Mascot search score of 69 and an ion score of 47. 
5 The peptide is part of alpha-fibrinogen. 

The sequence from gi|1706799|sp|P0267l was used to search for the masses found in the 
pooled fraction. The m/z 5901.9 Da peptide can be a part of alpha-fibrinogen, and the 
tryptic peptide (MH + 1190,5) can be included in the m/z 5901.9 Da peptide. The sequence 
10 is: 



15 



Peptide Sequences 
5901.9 Peptide 

SSSYS KQFTSSTSYNR GDSTFESKSYKMADEAGSEADHEGTHSTKRG HAKSRPV (SEQ ID NO 6) 

The bold underlined part of the sequence shows the tryptic peptide (MH + 1190.5 Da). 

The tryptic peptide does unfortunately also fit to the masses 5333.5 and 6187.05 Da found 
in the fraction. 



20 



S333.S 

GIFTNTKESSS H H PGIAEFPS RG KSSS YS ICQFTSST§Y_NBG DSTFESKS (SEQ ID NO 7) 



or 



25 



SGIFTNTKESSSHHPGIAEFPSRGKSSSYSKQFTSSISXMEGDSTFESK (SEQ ID NO 8) 
6187. OS Peptide 

GSESGIFTNTKESSSHHPGIAEFPSRGKSSSYS KOFTSSTSYNR GnciTF ESKSYKMA (SEQ ID NO 
9) 

Conclusion 

30 One peptide was found after digest of the gel band containing the "5900 Da peptide". 
Fragmentation of the peptide (MH + 1190.5) by MALDI-TOF/TOF gave the sequence 
(QFTSSTSYNR). This Is part of alpha-fibrinogen. Searching the alpha-fibrinogen sequence 
for the mass m/z 5901.9 gave a hit where the sequence from the tryptic peptide also is 
included. The sequence does also fit to the masses 5333.5 and 6187.05, respectively. 

35 Sample 2 
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Results: 

From the MALDI-TOF spectra, the peaks m/z 2363.05 and 1686.84 Da were found to 
corresponds to the masses from the SELDI approach. 

It was only possible to make ESI-MS/MS on 2363.05. The peptide was seen a triply 
charged ion. Attempts were also made to make MALDI-TOF/TOF on these peaks, but 
without success. 



Peptide Sequences 



10 De-Novo sequencing gave the tag: FLGMFI YF m ^ , n) Tnis was searcned as a 
sequence tag together with the mass MH 3+ 788.3 Da. This matched the peptide 

FLGMFLYEYAPRHP DYSW (*Fn ID NO 1 1 V 

A similarity search of the sequence shown in table 22, resulted in the following hit: 
15 Table 22 Similarity search for the 6187.05 peptide 




Human Serum Albumin In A Complex 



Conclusion 

Direct analysis of the sample with MALDI-TOF showed the peptides of interest. ESI-MS/MS 
was only possible on mass MH 3+ 788.3 Da. This matched the peptide: 
20 FLG M FLYEYARRH PDYS VV. This can be a part of alpha-fetoprotein/human serum albumin. 

Figure 13 presents the observed pattern of peptides in the region form 1900 to 2500 Da, 
the present inventors propose that the possible markers of values 1945, 2210, 2230, 2250 
and 2275 Da are somehow related. The pattern could indicate: 



25 



30 



1) Fragmentation of a larger protein 

2) Ligand binding peptides 

3) Proteolytic processing of peptides 

4) Translational / transcriptional regulation of peptides. 



SUBSTITUTE SHEET (RULE 26) 



WO 2004/090550 PCT/DK2004/000263 

64 

EXAMPLE. 16 A METHOD FOR DISCRIMINATING BETWEEN HEALTHY INDIVIDUALS AND 
PATIENTS WITH COLON CANCER 



The aim of the study was to determine if visual inspection of mass spectra is a method for 
5 discriminating between healthy individuals and patients with colon cancer. 



Computer programs: 

Ciphergen ProteinChip Software 
Excel 



10 



Data: 

Serum samples from 47 healthy individuals and 24 patients diagnosed with colon cancer 
were assayed on IMAC30 chips and analysed as described above. 
Intensities were normalised based on total ion current. 



15 



Method: 

Raw data was exported from Ciphergen ProteinChip Software to Excel, mean and standard 
error of means (SEM) was calculated for each m/z value. 

Mean and SEM was imported in R. Plots for specific regions of the spectra were designed. 
20 The specific regions were chosen to include the 5 markers used for Principal component 
analysis as described above. 

Results: 

Figure 20 A-E shows the average intensity spectra of healthy individuals (solid) and 
25 patients diagnosed with colon cancer (dashed). The standard errors of means (SEM) are 
shown with bars. 

A: The area from 3900 to 4100 Da, SEM shown for 3960 and 3980 Da. B: The area from 
5200 to 5400 Da, SEM shown for 5340 and 5350 Da. C: The area from 5800 to 6000 Da, 
SEM shown for 5906 and 5920 Da. D: The area from 6800 to 7000 Da, SEM shown for 
30 6880 and 6940 Da. E: The area from 27000 to 29000 Da, SEM shown for 28025 Da. 

Conclusion 

Visual inspection of specific regions can be used for discriminating healthy individuals from 
patients with colon cancer. 

35 
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EXAMPLE 17 POSSIBLE IDENTIFICATION OF SERUM MARKERS BY DIRECT MASS 
SEARCH. 



The aim of this study was to search a database for proteins with known mass corresponding to 
5 the measured mass value of the markers identified. This may constitute a possible 
identification. 

Methods 

The measured mass value is analysed on the "Tagldent Tool" on the ExPASy server. 
10 With the following parameters: 

Mass value range: 0.2% 

pI: Complete range 

Organism: Human 

Database(s) on which the scan should be performed: Swiss-Prot 

15 



Results 

By searching the database with the mass value of each of the markers a number of possible 
identifications occur (hits): 



20 

Table 22 Up-regulated serum markers 



Marker 11900, up-regulated on H50 


Gene symbol 


Accession No. 


Annotation 


ANFB_HUMAN 


(P16860) 


Gamma-brain natriuretic peptide. 


ICE2_HUMAN 


(P42575) 


Caspase-2 subunit pl2. 


ICE3_HUMAIM 


(P42574) 


Apopain pl2 subunit. 


ICEA_HUMAN 


(Q92851) 


Caspase-10 subunit pl2. 


LRP2__HHV1F 


(P17589) 


Latency-related protein 2. 


VE4_HPV41 


(P27553) 


Probable E4 protein. 


VE7_HPV56 


(P36833) 


E7 protein. 


VE7_HPV66 


(Q80956) 


E7 protein. 


YG49_HUMAN 


(Q9BY77) 


Hypothetical protein KIAA1649 




Marker 11700, up-regulated on H50 


Gene symbol 


Accession No. 


Annotation 


GPA2_HUMAN 


(Q96T91) 


Glycoprotein hormone alpha 2. 


LSM3_HUMAN 


(Q9Y4Z1) 


U6 snRNA-associated Sm-like protein LSm3 (MDS017). 


MIR2_HUMAN 


(Q9Y6H6) 


Potassium voltage-gated channel subfamily E member 3 


NRTIM_HUMAN 


(Q99748) 


Neurturin. 
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S103_HUMAN 


(P33764) 


^xuw taiuuiii Dmoing protein aj ^o-iuue protein). 


SAA_HUMAN 


(P02735) 


Serum amvloid A nrnfpin 


ULA9_HCMVA 


(P16738) 


Hvoothetiral nmt-oin ill mo 
• ijrpuu i^iitai protein unuy. 


VE7_HPV05 


(P06932) 


E7 d rote in 


VE7_HPV5B 


(P26559) 


E7 Drotein 


E311_ADE02 


(P24935) 


• / 1 — J.J..U kucj glycoprotein. 


FKBB_HUMAN 


(Q16645) 


FK506~binriin/i nrnfoin 1 n 


GLRX^HU MAN 


(P35754) 


vjiuioicuuaiii ^ i j uuiLransrerasej ^i iase). 


RU\2_HUMAN 


(P05387) 


auuiL i luvjiuiiiai protein k^£. 


S114_HUMAN 


(Q9HCY8) 


S100 calcium-binding protein A14 (S114). 


SM3 INHUMAN 


(P55854) 


Ubiquitin-like protein SMT3A. 


TAT_HV1MN 


(P05905) 


TAT protein (Transactivating regulatory protein). 


VE7_HPV08 


(P06430) 


E7 protein. 


Yllb_ADE02 |(P03287) 


Hypothetical 11.6 kDa early protein 




Marker 11550, u) 


D-regulated on H50 


Gene symbol 


Accession No. 


Annotation 


CF53_HUMAN 


(Q9P0S9) 


Protein C6orf53 (Protein HSPC194). 


HMGI__HUMAN 


(P17096) 


High mobility group protein 


INI7_HUMAN 


(P40305) 


Interferon-alpha induced 11.5 kDa protein (p27). 


K413_HUMAN 


(Q9BYU7) 


Keratin associated protein 


KV1W_HUMAN 


(P04431) 


Ig kappa chain V-I region Walker precursor 


TAT_HV1A2 


(P04614) 


TAT protein (Transactivating regulatory protein). 


TATJHVIOY 


(P20893) 


TAT protein (Transactivating regulatory protein). 


TAT_HV1RH 


(P05908) 


TAT protein (Transactivating regulatory protein). 


ULB1_HCMVA 


(P16831) 


Hypothetical protein UL111. ~ 


VE7_HPV19 


(P36822) 


E7 protein. 


VE7_HPV21 


(P50779) 


E7 protein. 


VE7_HPV47 


(P22423) 


E7 protein. 


VPR_HV1A2 


(P05952) 


VPR protein (R ORF protein). 


Y115__ADE07 


(P03288) 


Hypothetical 11.5 kDa early protein. 




Marker 11500, up 


•-regulated on H50 


Gene symbol 


Accession No. 


Annotation 


LV1G_HUMAN 


(P06316) 


Ig lambda chain V-I region BL2. 


PRP1_HUMAN 


(P04280) 


Salivary proline-rich protein precursor 


RLA1_HUMAN 


(P05386) 


60S acidic ribosomal protein PI. 


RT16_HUMAN 


(Q9Y3D3) ; 


28S ribosomal protein S16. 


S11Y_HUMAN 


(Q9UDP3) 1 


Putative S100 calcium-binding protein H_JMH0456N16.1. 
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TAT_HV1JR 


(P20879) 


imi procein i i ransactivatmg regulatory protein). 


TAT_HV1S1 


(P19553) 


TAT protein (Transactivating regulatory protein) 


TAT_HV1S3 


(P19552) 


TAT protein (Transactivating regulatory protein). 


VE7JHPV12 


(P36819) 


E7 protein. 1 


VE7_HPV25 


(P36823) 


E7 protein. 


VPR3_HUMAN 


(Q9UKI3) 


Pre-B lymphocyte protein 3. 1 


VPRJHVIOY 


(P20891) 


VPR protein (R ORF protein). 



Marker 15200, up-regulated on CM10 



Gene symbol 



CYB5_HUMAN 
ENR1J-1UMAN 
H33_HUMAN 



H3B_HUMAN 



LSM1__HUMAN 



SSBJHUMAN 



Accession No. 



(P00167) 
(Q14264) 
(P06351) 



(Q93081) 



(015116) 



(Q04837) 



Annotation 



Cytochrome fc>5. 

Transmembrane protein (By similarity). 
Histone H3.3 
Histone H3/b. 



U6 snRNA-associated Sm-like protein LSml 



Single-stranded DNA-binding protein. 



Marker 6125, up-regulated on CM10 

Accession No. 



Gene symbol 



MT1AJ-IUMAN 
MT1BJHUMAN 



(P04731) 
(P07438) 



Annotation 
Metallothionein-IA (MT-1A). 
Metallothionein-IB (MT-1B). 



Marker 5900, up-regulated on CM10 



Gene symbol 
A4_HUMAN 



Accession No. 



(P05067) 



Annotation 



Gamma-CTF(50) (By similarity). 



Marker 33000, up-regulated on SAX2 

Annotation 



Gene symbol 



ADT1_HUMAN 

CAMG_HUMAN 

DSR3_HUMAN 



LECH_HUMAN 



MC33_HUMAN 



MCAT_HUMAN 



MDHM_HUMAN 
MIOXJHUMAN 



MSLN_HUMAN 



PCTL_HUMAN 



Accession No. 



(P12235) 
(P49069) 
(014972) 



(P07306) 



(Q14805) 



(043772) 



(P40926) 
(Q9UGB7) 



(Q13421) 



R1AB_CVH22 



(Q9Y365) 



(Q05002) 



ADP,ATP carrier protein 
Calcium-signa l modulating cyclophilin ligand (CAML), 
Down syndrome critical region protein 3 



Asialoglycoprotein receptor 1 



Metaphase chromosomal protein 1 



Mitochondrial carnitine/acylcarnitine carrier protein 



Malate dehydrogenase. 
Inositol oxygenase 
Mesothelin. 



PCTP-like protein 



Replicase polyprotein lab 



SUBSTITUTE SHEET (RULE 26) 



WO 2004/090550 



PCT/DK2004/000263 



68 



! D 1 A n r\/u C A 
I KlAD_LVnbA 


(P59641) 


NSP3 (By similarity). I 


IrFM Mi IMAM 




G i P-binding protein REM 


\*ZCZC7 Ml IMAM 

{ n IJ l v IMIM 




Zeta-sarcoglycan (Zeta-SG) (ZSG1). | 


I^TTIA Ml IMAM 
I «-> I In il U l v IMIM 




Syntaxin 1A (Neuron-specific antigen HPC-1). 


iT9PR miimam 




Transcription initiation factor HE, beta subunit 


Itmtm miimam 
ii n i i v i nui v iMiM 




3-mercaptopyruvate sulfurtransferase (EC 2.8.1.2) MST | 


lllPPI MIIMAM 


/DO C07yl \ 

^rZ3o/4) 


Mitochondrial Drown fat uncoupling protein 1 (UCP 1) | 


Il II H*7 MM\/1 1 

ulu / nnvjii 


(K1U191) 


Protein ULA 


Il il n~? uu\nn ~~ 
ULU/ nrivzri 


(r 89430) 


Protein UL7. " " " 


l\/Fil HD\/47 




Probable E4 protein. 


VP1Q H(~M\/A 




Capsid protein VP19C 


|f||Q7 MIIMAM 




Hypothetical protein C21orf87. 


IfntnRI Mt IMAM 
VJV3P1 .nUrlMlN 




G antigen family B 1 protein 


GGD2_HUMAN 


(Q9HD64) 


G antigen family D 2 protein 


ID1_HUMAN 


(P41134) 


DNA-binding protein inhibitor ID-1 ' * 


POLG_HRV16 


(Q82122) 


Core protein p2A. 


POLG_HRV89 


(P07210) 


Core protein p2A. " ' 


PP13_HUMAN 


(Q9UHV8) 


(Placenta protein 13) 




Marker 15935, U[ 


^regulated on SAX2 


[Gene symbol 


Accession No. 


Annotation 


CAL5_HUMAN 


(Q9N2T1) 


Calmodulin-like protein 5 


COAC.HUMAN 


(Q14019) 


Coactosin-like protein. 


GML_HUMAN 


(Q99445) 


Glycosyl-phosphatidylinositol-anchored molecule-like | 


HBD_HUMAN 


(P02042) 


Hemoglobin delta chain. 


HPT.HUMAN 


(P00738) 


Haptoglobin alpha chain. " ~\ 


IR09_HCMVA 


(P16807) 


Hypothetical protein IRL9 (TRL9). ~~ ( 


M46E_HUMAN 


(Q96DS6) 


Membrane-spanning 4-domains subfamily A member 6E. | 


RS19_HUMAN 


(P39019) 


40S ribosomal protein S19 


SJ2B_HUMAN 


(P57105) 


Synaptojanin 2 binding protein 


ULC6_HCMVA 


(P16836) 


Hypothetical protein UL126. 




Marker 15200, up 


-regulated on SAX2 j 


jGene symbol 7 


Accession No. , 


Annotation 


CYB5_HUMAN 


(P00167) 


Cytochrome b5. ~~ 


ENR1_HUMAN 


(Q14264) 


Transmembrane protein (By similarity). 


H33.JHUMAN 


(P06351) 


Histone H3.3 


H3B_HUMAN | 


(Q93081) 


Histone H3/b. 


LSM1_HUMAN ] 


(015116) I 


U6 snRNA-associated Sm-like protein LSml | 
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SSB_HUMAN [(QQ4837) Single-stranded DMA-binding protein. 



Marker 60500 u 


p-regulated on IMAC30 * 


Gene svmbol 


Accession No. 


Annotation 


A1AD HUMAN 


(P25100) 


Alpha-ID adrenergic receptor 


CBS HUMAN 


(P35520) 


Cystathionine beta-synthase 


CDY1 HUMAN 


(Q9Y6F8) 


Testis-specific chromodomain protein Y 1. 


CDY2 HUMAN 


CQ9Y6F7) 


Testis-specific chromodomain protein Y 2. 


ELS HUMAN 


(P15502) 


Elastin precursor (Tropoelastin). 


EST1 HUMAN 


(P23141) 


Liver carboxylesterase. 


FIB1 ADE41 


(P14267) 


Fiber protein 1. " ~ 


GKP2 HUMAN 

V3i\r a. i iui I/-MM 


(Q14410) 


Glycerol kinase, testis specific 2 


GKP? HUMAN 


(Q14409) 


Glycerol kinase, testis specific 1 


N4R"3 HI IMAM 


(015049) 


Nedd4-binding protein 3 (N4BP3). 


SMA4_HUMAN 


(Q13485) 


(SMAD 4) 


SUW1JHUMAN 


(P59817) 


Suppressor of hairy wing homolog 1 (3'OYll.l). | 


TCPG^HUMAN 


(P49368) 


(TCP-l-gamma) (CCT-gamma). j 


THAS_HUMAN 


(P24557) 


Thromboxane-A synthase 


TTC8_HUMAN 


(Q8TAM2) 


Tetratricopeptide repeat protein 8 


Y469_HUMAN 


(Q9UJP4) 


Hypothetical protein KIAA0469. 


Z306_HUMAN 


(Q9BRR0) 


Zinc finger protein 306 


Z479JHUMAN 


(Q96JC4) 


Zinc finger protein Krl9) (HKrl9). 



Marker 19900, up-regulated on IMAC30 



Gene symbol 



AMEX_HUMAN 
CIT1_HUMAN 



CLE1_HUMAN 
CRAAJHUMAN 



FRIL_HUMAN 
GILTJ-IUMAN 



KR45_HUMAN 



RB8A_HUMAN 
TD52_HUMAN 



YAF2_JHUMAN 



Accession No. 



Q99217) 
Q99966) 



075596) 
P02489) 



P02792) 
P13284) 



Q9BYR2) 



Q8IY57) 



Annotation 

Ameiogenin, X isoform 
Cbp/p300-interacting transactivator 1 



C-type lectin superfamily member 1. 
Alpha crystallin A chain. 



Ferritin light chain (Ferritin L subunit). 
(Gamma-interferon-inducible protein IP-30), 



Keratin associated protein 4-5 




YYl-associated factor 2. 



Marker 11080, up-regulated on IMAC30 



Gene symbol Accession No 



Annotation 
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IDSJ-iUMAN 



S11CLHUMAN 



TAT_HV1EL 
VE7_HPV65 



(P22304) 



(P08206) 



(P04611) 
(Q07859) 



Iduronate 2-sulfatase 14 kDa chain. 



Calpactin I light chain 
TAT protein 
E7 protein. 



Marker 10830, uj 


p-regulated on IMAC30 — " — 


(Gene symbol 


Accession No. 


Annotation 


LSM2_HUMAN 


(Q9Y333) 


U6 snRNA-associated Sm-like protein LSm2 


LST1_HUMAN 


(000453) 


Leukocyte specific transcript 1 protein 


POLGJHE701 


(P32537) 


Core protein p2B. " 


POLJHVIND 


(P18802) 


Protease. " ' 


POLJ-IVIOY 


(P20892) 


Protease. — 


POL_HV2BE 


(P18096) 


Protease. ™ -J 


S108_HUMAN 


(P05109) 


Calgranulin A 


VE7_HPV33 


(P06429) 


E7 protein. 


VE7_HPV58 


(P26557) 


E7 protein. — — - _ i 



Marker 9140, up-regulated on IMAC30 



Gene symbol 
LSM6_HUMAN 
SAP_HUMAN 



VPU_HV1LW 



Accession No. 
(Q9Y4Y8) 
(P07602) 



(Q70625) 



Annotation 

U6 snRNA-a ssociated Sm-like protein LSm6 
Saposin D. 

VPU protein (U ORF protein). 



I Gene symbol 
APC2_HUMAN 



Marker 8930, up-regulated on IMAC30 

Annotation 
Apolipoprotein C-IL 



!IL8_HUMAN 



PLMNJHUMAN 



|SLUR_HUMAN 
SRG1J-IUMAN 
SY08_HUMAN 
VGLF_PI2H 
VGLF_PI2HG 



Accession No. 
(P02655) 



(P10145) 



(P00747) 



(P55000) 



VGLF_PI2HT 



(075711) 
(P80075) 
(P25467) 
(P27286) 



(P26629) 



Interleukin-8. 
Activation peptide. 



Secreted Ly-6/uPAR related protein 1 
Scrapie-responsive protein 1. 
Small inducible cytokine A8. 
Fusion glycoprotein F2. 
Fusion glycoprotein F2. 



Fusion glycoprotein F2. 



Marker 6110 / up-regulated on IMAC30 
[Gene symbol (Accession No. (Annotation 
MT1B_HUMAN 
PPLA_HUMAN 



(P07438) 
(P26678) 



Metallothionein-IB (MT-1B). 
Cardiac phospholamban (PLB). 
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WFAB_HUMAN 


(Q8IUB3) 


Marker fiOQH nn- 





Protein WFDC10B. 




Accession No. 

Metallothionein-IF (MT-1F) (HQP0376). 



Marker 5920, up-regulated on IMAC30 



Gene symbol 
A4 HUMAN 



Accession No. 



(P05067) 



Annotation 



Gamma-CTF(50) (By similarity). 



Marker 5900, up-regulated on IMAC30 



Gene symbol 
A4_HUMAN 



GAGJHV1A2 



Accession No. 



(P05067) 



(P03349) 



Annotation 



Gamma-CTF(50) (By similarity). 



Core protein p6. 



Marker 5330, up-regulated on IMAC30 

Accession No 



Gene symbol 



TISR_HUMAN 



(Q9Y5M6) 



Annotation 



Oculomedin 



Table 23 D own- regulated serum markers 
Marker 46000, down-regulated on H50 

Accession No. 



Gene symbol 



AB3BJHUMAN 
B3G7_HUMAN 



BTB6_HUMAN 



CRF2_HUMAN 



EGL1_HUMAN 



FXF2_HUMAN 
GPT_HUMAN 



K1HA_HUMAN 



MCR1_HUMAN 



(Q9UH17) 
(Q9NY97) 



(Q96KE9) 



(Q13324) 



(Q9GZT9) 
(Q12947) 
(Q9H3H5) 



Annotation 



Phorbolin 1-related protein) 
Beta-l,3-galactosyltransferase 7 



BTB/POZ domain containing protein 6 



(076009) 



NCAP_CVHSA 



PRD7_HUMAN 
RL3_HUMAN 
S143_HUMAN 
SSXT_HUMAN 



TDG_HUMAN 



(Q99705) 
(P59595) 



(Q9NQW5) 



(P39023) 

(Q9UDX4) 

(Q15532) 



(Q13569) 



Corticotropin releas ing factor receptor 2 precursor 
Egl nine homolog 1 
Forkhead box protein F2 
(EC 2.7.8.15) (GPT 



Keratin, type I cuticular HA3-I 



Melanin-concentrating hormone receptor 1 



Nucfeocapsid protein 



(Oligosaccharyl transferase 48 kDa subunit) 
Phosphatidylinositol-4-phosphate 5-kinase type III 



PR-domain zinc finger protein 7. 



60S ribosomal protein L3 
SEC14-like protein 3 
SSXT protein 



G/T mismatch-specific thymine DNA glycosylase 
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TR1BJ-IUMAN 
Z193_HUMAN 
!Z514_HUMAN 



Gene symbol 
AAAD__HUMAN 



BHB2_HUMAN 
CL02_HUMAN 



COT2_HUMAN 
CV05_HUMAN 
CXA7__HUMAN 
DEMA_HUMAN 



DOK2_HUMAN 



FUT4JHUMAN 
GAG2_HUMAN 
IL5RJ-RJMAN 



MKK2_HUMAN 
NTR2_HUMAN 
ODBAJ-IUMAN 
PCOIJHUMAN 
PLA1_HUMAN 



PREB_HUMAN 
PSD6_HUMAN 



Accession No. [Annotation 
(P22760) Arylacetamide deacetylase 



(014503) 
(Q8NHQ8) 



[Class B bas ic helix-loop-helix protein 2 
Protein C12orf2 



(P24468) 
(Q9Y519) 
(P36383) 
(Q08495) 



COUP transcription factor 2 
[Putative MAP k inase activating protein 
[Gap junction alpha-7 protein 
iDematin 



(060496) 




(P22083) 
(P10264) 
(Q01344) 



(P49137) 
(095665) 
(P12694) 
(Q15113) 

. 4 

(Q9HB21) 



Fucosy transferase 4 
[HERV-K10 putative GAG polyprotein 2. 
jlnterleukin-5 rece ptor alpha chain precursor 
[MAP k inase-activated protein kinase 2 
[Neurotensin receptor type- 2 
[2-oxoiso valerat e dehydrogenase alpha subunit, 
Procollagen C-pro teinase enhancer protein precursor 

Pleckstrin homology domain-containing protein family A 
member 1 



RHCEJHUMAN 



RT29_HUMAN 




TC10_HUMAN 



TCO!_HUMAN 



TC02J-IUMAN 



VE2_HPV50 



VE2_HPV63 



VE2_HPV65 



VRK1_HUMAN 



WDR4_HUMAN 



(Q9HCU5) 
(Q15008) 
(P18577) 



(Q12799) 



P20061) 



(P20062) 




(Q80930) 



(Q07850) 



(Q07851) 



(Q99986) 



(P57081) 



Prolactin regula tory element-binding protein. 
26S proteaso me non-ATPase regulatory subunit 6 
Blood group Rh(CE) polypeptide 



T-complex protein 10A homolog. 



Transcobalamin L 



Transcobalamin II. 



Regulatory protein E2. 



Regulatory protein E2. 



Regulatory protein E2. 



Serine/threonine protein kinase VRK1 



WD-repeat protein 4. 
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Marker 8940, down-regulated on H50 



Gene symbol 


Accession No. 


Annotation 


SLUR_HUMAN 


(P55000) 


Secreted Ly-6/uPAR related protein 1. 


SRG INHUMAN 


(075711) 


Sera pie-responsive protein 1. 


SY07JHUMAN 


(P80098) 


Small inducible cytokine A7. 


VE5_HPV58 


(P26552) 


Probable E5 protein. 




Marker 8230, doi 


/vn-regulated on H50 


Gene symbol 


Accession No. 


Annotation 


PSCAJHUMAN 


(043653) 


Prostate stem cell antigen. 


UGR2_HUMAN 


(Q96QR1) 


Uteroglobin-related protein 2. 


ULD1_HCMVA 


(PI 6773) 


Hypothetical protein UL131. 




Marker 6650, do^ 


wn-regulated on H50 


Gene symbol 


Accession No. 


Annotation 


68MP_HUMAN 


(P56378) 


6.8 kDa mitochondrial proteolipid 


A4JHUMAN 


(P05067) 


Gamma-CTF(57). 


CCKINMHUMAN 


(P06307) 


Cholecystokinin CCK58. \ 


NRG4_HUMAN 


(Q8WWG1) 


Neuregulin-4. 


PARTJHUMAN 


(Q9NPD0) 


Prostate-specific and androgen regulated protein PART-Tl 


PE19_HUMAN 


(P48539) 


Brain-specific polypeptide PEP-19 


RS30JHUMAN 


(Q05472) 


40S ribosomal protein S30. 




Marker 6450, dov 


vn-regulated on H50 " 


Gene symbol 


Accession No. 


Annotation | 


3CL_HUMAN 


(Q13412) 


Pre-T/NK cell associated protein 3CL 


E306_ADE35 


(P17591) 


Early E3 6.4 kDa protein. 


GAG_HV1A2 


(P03349) 


Core protein p7. 


GAG_HV1B1 


(P03347) 


Core protein p7. 


GAG_HV1JR 


(P20873) 


Core protein p7. 


GAG_HV1MN 


(P05888) 


Core protein p7. 


GAGJ-IV1PV 


(P03350) 


Core protein p7. 


GLPE_HUMAN 


(P15421) 


Glycophorin E. 



Marker 1536, down-regulated on CM10 



Gene symbol 
CCKN_HUMAN 
FIBAJHUMAN 



Accession No. 
(P06307) 
(P02671) 



Annotation 
Cholecystokinin CCK12. 
Fibrinopeptide A. 
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Marker 66500, d 


lown-regulated on IMAC30 " ' " — 


[Gene symbol 




Annotation 


AFAM_HUMAN 




Mramin. 


ALBUJHUMAN 




oerum albumin. 


AN21_HUMAN 




m A Jft u k .—^k - . to ft b ^ ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 

Ankynn repeat domain protein 21 


BRL1__EBV 




Transcription activator BRLF1. 


CALIJ-IUMAN 




call cm. 


CD93JHUMAN 




Complement component Clq receptor. 


CDYL_HUMAN 




cnromodomam Y-like protein 


FETAJHUMAN 


( POP771 ^ 


Alpha-fetoprotein precursor 


FPGT_HUMAN 


/01 4777"\ 


rucose-i-pnosphate guanylyltransferase 


FUT8_HUMAN 


W"DTCDJ 


Alpha-( l,6)-fucosyltransferase 


GBP5_HUMAN 




Interferon-inaucea guanylate-binding protein 5 


GDSIJHUMAN 


^rDOUD ) 


Rapl GTPase-GDP dissociation stimulator 1 


GRK4_HUMAN 




G protein-coupled receptor kinase 


MM09_HUMAN 


(P14780) 


type IV collagenase. 


MOT8_HUMAN 


(P36021) 


Monocarboxylate transporter 8 " 


NR42_HUMAN 


(P43354) 


Orphan nuclear receptor NURR1 


SNX9J-IUMAN 


(Q9Y5X1) 


Sorting nexin 9 "~ 


STB2JHUMAN 


(Q15833) 


Syntaxin binding protein 2 


VP40_HHV11 


(P10210) 


Gene UL26 protein. 


VU47_HHV6U 


(Q06093) 


Glycoprotein U47. " " — 




marker 44JUU, dc 


>wn-regulated on IMAC30 H 


iGene symbol 


Accession No. 


Annotation 


A1ATJ-IUMAN 


(P01009) 


Alpha-l-antitrypsin. 


ABA2_HUMAN 


(Q96P71) 


Amyloid beta A4 protein-binding family A " | 


APL3_HUMAN 


(095236) 


Apolipoprotein L3 


CEA2_HUMAN 


(Q9NPF8) 


Centaurin alpha 2. " 


CK16_HUMAN 


(Q9NQ32) 


Protein Cllorfl6. 


D3DR_HUMAN 


(P35462) 


D(3) dopamine receptor. "~ " \ 


DCT2_HUMAN 


(Q13561) 


Dynactin complex 50 kDa subunit 


ELK3_HUMAN i 


(P41970) | 


ETS-domain protein Elk-3 


GATM_HUMAN I 


(P50440) ( 


Glycine amidinotransferase 


GBAF_HUMAN < 


[P38405) < 


guanine nucleotide-binding protein G(olf) 


HXB3_HUMAN j 


[P14651) I 


-lomeobox protein Hox-B3 " 


KLFC_HUMAN ( 


;Q9Y4X4) 1 


<rueppel-like factor 12 


LHX2_HUMAN 


;P50458) 


JM/homeobox protein Lhx2 " 
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MM11__HUMAN 


(P24347) 


Stromplvciri-"} ' 1 

vyi i icty oil I j » 


MPK4_HUMAN 


(P45985) 


■ mi laic KlildSG 


OMGP_HUMAN 


(P23515) 


^iiyuuciiurocyte-rnyeiin glycoprotein. 


P2X3J-IUMAN 


(P56373) 




PSG3_HUMAN 


(Q16557) 


' icyjianLy ^jjcciric Deta- l-giycoprotein 3 


RUN3_HUMAN 


(Q13761) 

X X» ^ 


i 1 1 1_ iciaicu LI al ibLiI ptlOn FaCtOr J 


S3B4JHUMAN 


(Q15427) 


Solicinn f?»rt"nr ciihunif vl " — ■ 


SB13JHUMAN 


(Q9UIV8) 


Hurnin 

1 IUI LJIt I 


SUT3_HUMAN 


(075486) 


1 1 ai iptiun initiation protein 5>PT3 homolog 


TE2I_HUMAN 


(Q9NYB0) 


ICIU, " CML repeat omaing factor 2 interacting protein 1 


TFT1_HUMAN 


(Q9NNX1) 


Tuftelin " ~ — ~ 


TDl IA 1— I 1 imam 


(Q9Y606) 


tRNA pseudouridine synthase A 


UL61_HCMVA 


(P16818) 


Hypothetical protein UL61. 


VE2_HPV03 


(P36778) 


Regulatory protein E2. 


VE2JHPV29 


(P50772) 


Regulatory protein E2. 


VE2_HPV41 


(P27552) 


Regulatory protein E2. " 


VU3JHHV7J 


(P52520) 


U3 protein. 



Marker 28121, d 


own-regulated on IMAC30 ~~ " 


iGene symbol 


Accession No. 


Annotation " ~~ ~~ 


143FJ-IUMAN 


(Q04917) 


14-3-3 protein eta (Protein AS1). 


143GJHUMAN 


(P35214) 


14-3-3 protein gamma 


ABME_HUMAN 


(P41238) 


Apolipoprotein B 


APA1_HUMAN 


(P02647) 


Apolipoprotein A-I precursor (Apo-AI). 


CCG6J-IUMAN 


(Q9BXT2) 


calcium channel gamma-6 subunit 


CDX1_HUMAN 


(P47902) 


Homeobox protein CDX-1 


CN G6__H U MAN 


(Q9Y224) 


Protein C14orfl66 (CGI-99). 


CTX3„HUMAN 


(Q9UJQ1) 


Protein C20orfl03 precursor. 


DRN2_HUMAN 


(000115) 


Deoxynbonuclease II precursor 


E1A_ADE04 


(P10407) 


Early E1A 28 kDa protein. " " 


EP34_HCMVA 


(P16768) 


Early phosphoprotein P34. 


FA7_HUMAN 


(P08709) 


Factor VII heavy chain. 


K247_HUMAN 


(Q92537) 


Protein KIAA0247 precursor. 


M4AQ_HUMAN 


(Q9NXJ0) 


Membrane-spanning 4-domains subfamily A member 12. 


MIP_HUMAN 


(P30301) 


Lens fiber major intrinsic protein 


MLF2J-IUMAN 


(Q15773) 


Myeloid leukemia factor 2 


ORC6_HUMAN 


(Q9Y5N6) 


Origin recognition complex subunit 6. 


PMM2_HUMAN 


(O15305) 


Phosphomannomutase 2 


PRPK_.HU MAN 


(Q96S44) 


p53-related protein kinase 
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RFXK_HUMAN 
iSTXA HUMAN 



TPAJHUMAN 
IWBP2 HUMAN 



(014593) 



(060499) 



(P00750) 
(Q969T9) 



DNA-binding protein RFXANK 
Syntaxin 10 (SynlO). 



Tissue-type plasm inogen activator chain 
WW domain binding protein 2 



Marker 28010, down-regulated on IMAC30 



Gene symbol 



2DOB_HUMAN 
CATW HUMAN 



CRAR HUMAN 



DB83JHUMAN 
|DGK_HUMAN 
GS2_HUMAN 



HXB9_HUMAN 
IF28_HUMAN 
M OX INHUMAN 
SHP__HUMAN 



SPREJHUMAN 



T4S8 HUMAN 



VP40 HCMVA 



Accession No. 



(P13765) 
(P56202) 



(P48740) 

(P57088) 
(Q16854) 
(P41247) 



(P17482) 
(Q96DX8) 
(P50221) 
(Q15466) 



(P35270) 



(O60637) 



(P16753) 



Annotation 



HLA class II histocompatibility antigen 
Cathepsin W 



Complement-activating component of Ra-reactive factor 
precursor 

DB83 protein. 

Deoxyguanosine kinase. 

GS2 protein (DXS1283E). 



Homeobox protein Hox-B9 
28 kDa interferon responsive protein. 
Homeobox protein MOX-1 
Orphan nuclear receptor SHP 



Sepiapterin reductase 



Transmembrane 4 superfamily 



Assemblin. 



Marker 28315, down- regulated on IMAC30 



Gene symbol 
AQP5JHUMAN 
BA29_HUMAN 



C151_HUMAN 



CBX7_HUMAN 



CHOD_HUMAN 



CSSl^HUMAN 



CU02_HUMAN 



ECHM_HUMAN 



EMX2 HUMAN 



IFE3_HUMAN 



NS3BJHUMAN 

POLG_EC22H 

'PSA3_HUMAN 



THAA_HUMAN 



UNG_HCMVA 



Accession No. 
P55064) 
Q9UHQ4) 



P48509) 



095931) 



Q9H9P2) 
P04632) 



043822) 



P30084) 



Q04743) 



O60573) 



Q9BS92) 
Q66578) 
P25788) 



Q9P2Z0) 



P16769) 



Annotation 
Aquaporin 5. 
B-cell receptor-associated protein 29 



Platelet-endothelial tetraspan antigen 3 



Chromobox protein homolog 7. 



Chondrolectin. 
Calpain small subunit 1 



Protein C21orf2 



Enoyl-CoA hydratase. 



Homeobox protein EMX2. 



Eukaryotic translation initiation factor 4E type 



NipSnap3B protein (SNAP1). 
Coat protein VP3. 
Proteasome subunit alpha type 3 



THAP domain protein 10. 



Uracil-DNA glycosylase 
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[VATP_HUMAN [(Q9Y5K8) (Vacuolar ATP synthase subunit D 



Marker 27700, d< 


own-regulated on IMAC30 " 


iGene symbol 


Accession imo. 


Annotation H 


143ZJ-IUMAN 




14-3-3 protein zeta/delta 


AQPA_HUMAN 




Aquaponn 10 


C1S__HUMAN 




Complement Cls component precursor 


CSS2_HUMAN 




Calpain small subunit 2 " 


FGFE_HUMAN 


(Q92915) 


Fibroblast growth factor-14 ' — j 


H XCo_HUMAIM 


(P31273) 


Homeobox protein Hox-C8 


NUCG_HUMAN 


(Q 14249) 


Endonuclease G. 


NXP2_HUMAN 


(095156) 


Neurexophilin 2. 


POLGJHE71B 


(Q66478) 


Coat protein VP2. 


SHHJHUMAN 


(Q15465) 


Sonic hedgehog protein C-product | 


SIX6__HUMAN 


(095475) 


Homeobox protein SIX6 


TMS2_HUMAN 


(015393) 


Transmembrane protease serine 2 non 


TRYA_HUMAN 


(P15157) 


Alpha-tryptase. 



|Marker 15580, down-regulated on IMAC30 

Annotation 



Gene symbol 



|CND8_HUMAN 



ECP__HUMAN 



IGJ_HUMAN 



!POLG_HRV2 
RET4JHUMAN 



SRB7_HUMAN 
VNS1_HRSVA 



Accession No, 



(Q9H867) 



(P12724) 



(P01591) 



(P04936) 
(P29373) 



(Q13503) 



(P04544) 



Protein C14orfl38. 




Core protein p2A. 
Retinoic acid-binding protein II, 



RNA polymerase II holoenzyme component SRB7 
Nonstructural protein 1 




Gene symbol 


Accession No. 


Annotation 


AOAFLHUMAN 


(P28039) 


Acyloxyacyl hydrolase small subunit. 


ASAH_HUMAN 


(Q13510) 


Acid ceramidase alpha subunit. 


C17_HUMAN 


(Q9NRR1) 


Cytokine-like protein C17. " 


[CU77_HUMAN 


(Q9NV44) 


Protein C21orf77. "1 


|nEF__HV1H2 


(P04601) 


Negative factor (F-protein) 




narKer 6680, dov 


vn-regulated on IMAC30 " ~~~ _ 


Gene symbol 


Accession No. 


Annotation 


CU51JHUMAN 


(P58511) 


Protein C21orf51. " 
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i ml Kcl DOOU; GO 1 


wn-regulated on IMAC30 " 


f^pnp cvmhnl 


Accession No. 


Annotation 


68MPJHUMAN 


(P56378) 


6.8 kDa mitochondrial proteolipid 


A4_HUMAN 


(P05067) 


Gamma-CTF(57). 


GALAJ-IUMAN 


(P22466) 


Galanin message-associated peptide. 


NRG4JHUMAN 


(Q8WWG1) 


Neuregulin-4. ~ ~ 


PE19_HUMAN 


(P48539) 


Brain-specific polypeptide PEP-19 


RS30_HUMAN 


(Q05472) 


40S ribosomal protein S30. 




marker 5430, dov 


vn-regulated on IMAC30 


Gene symbol 


Accession No. 


Annotation ~~ — . 


E306__ADE35 


(P17591) 


Early E3 6.4 kDa protein. 


GAGJHV1BR 


(P03348) 


Core protein p7. " ~ 


GAGJ-IV1H2 


(P04591) 


Core protein p7. 


GAG_HV1LW 


(Q70622) 


Core protein p7. 


MT4_HUMAN 


(P47944) 


Metallothionein-IV (MT-IV). 


YG02_HUMAN 


(060908) 


Hypothetical 6.4 kDa protein A-363E6.1. 



EXAMPLE 18 POSSIBLE IDENTIFICATION OF PLASMA MARKERS BY DIRECT MASS 
SEARCH. 

This identification of plasma markers was performed as described for the serum markers in 
Example 17. 



Table 24 Up-regulated plasma markers 



Marker 14100, up-regulated on IMAC30 

Annotation 



Short name 



BATF__HUMAN 



HEX9_ADE07 



IL9_HUMAN 



YYY3_HUMAN 



Code 




(Q16520) 



(P03283) 



(P15248) 



(P20931) 




Very very hypothetical B-cell growth factor 
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Marker 14030, up 


-regulated on IMAC30 ~~ " ~~ 


Short name 


i-oae 


Annotation 


CTRB__HUMAN 


(P17538) 


Chymotrypsin B chain B. 


GRL1_ HUMAN 


(Q9H0R8) 


Gamma-aminobutyric acid receptor-associated protein-like 


H2AA_HUMAN 


(P28001) 


Histone H2A.a 


H2AM__HUMAN 


(P04908) 


Histone H2A.m (H2A/m). 


PRB4__HUMAN 


(P10163) 


Salivary proline-rich protein PO precursor 


UL30_HCMVA 


(P16765) 


Hypothetical protein UL30. 




Marker 13870, up 


-regulated on IMAC30 


Short name 


Code 


Annotation 


CST8_HUMAN 


(060676) 


Cystatin 8 " " " 


CYTDJ-IUMAN 


(P28325) 


Cystatin D. 


H2BE_HUMAN 


(Q99879) 


Histone H2B.e (H2B/e). " 




Marker 13747, up 


-regulated on IMAC30 


Short name 


Code 


Annotation — - — - 


ASAH_HUMAN 


(Q13510) 


Acid ceramidase alpha subunit. 


CHM1_HUMAN 


(075829) 


Chondromodulin-L 


H2BJ__HUMAN 


(Q93079) 


Histone H2B j (H2B/j). 


H2BR_HUMAN 


(P06899) 


Histone H2B.r (H2B/r) (H2B.1). " 


RS25JHUMAN 


(P25111) 


40S ribosomal protein S25. 


TTHY_HUMAN 


(P02766) 


Transthyretin. 


V AG INHUMAN 


(075348) 


Vacuolar ATP synthase subunit G 1 




Marker 11723, up-regulated on IMAC30 


Short name 


Code 


Annotation 


ALK1_HUMAN 


(P03973) 


Antileukoproteinase 1. 


B2MG_HUMAN 


(P01884) 


Beta-2-microglobulin. 


GPB5_HUMAN 


(Q86YW7) 


Glycoprotein hormone beta 5. 


LSM3__HUMAN 


(Q9Y4Z1) 


U6 snRNA-associated Sm-like protein LSm3 


MIR2JHUMAN 


(Q9Y6H6) 


Potassium voltage-gated channel subfamily E member 3 


PRL5_HUMAN 


(Q99954) 


Proline-rich protein 5 


RE\ThV2RO 


(P04615) 


Anti-repression transactivator protein 


S103_HUMAN 


(P33764) 


S100 calcium-binding protein A3 


S1G4_HUMAN 


(P26447) 


Placental calcium-binding protein 


S111_HUMAN 


(P31949) 


Calgizzarin 


S209_HUMAN 


(Q07325) 


Small inducible cytokine B9 


ULA9_HCMVA 


(P16738) 


Hypothetical protein UL109. " " 
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Marker 9950, up-i 


egulated on IMAC30 


Short name 


Code 


Annotation 


CAR7LHUMAN 


(Q16568) 


Cocaine- and amphetamine-regulated transcript protein 


K123_HUMAN 


(P60328) 


Keratin associated protein KAP12- 


NUOS_HUMAN 


(Q9IMRX3) 


NADHrubiquinone oxidoreductase MLRQ subunit homolog 


VE4_HPV51 


(P26548) 


Probable E4 protein 




Marker 7469, up-r 


egulated on IMAC30 " 


Short name 


Code 


Annotation ~~ ~~ 


IGF2_HUMAN 


(P01344) 


Insulin-like growth factor IL 




Marker 5905, up-r 


egulated on IMAC30 


Short name 


Code 


Annotation " 


A4_HUMAN 


(P05067) 


Gamma-CTF(50) (By similarity). 




Marker 4977, up-r 


egulated on IMAC30 ~~ 


Short name 


Code 


Annotation 


GIPJ-iUMAN 


(P09681) 


Gastric inhibitory polypeptide. 




Marker 4136, up-r 


egulated on IMAC30 


Short name 


Code 


Annotation 


UCN3_HUMAN 


(Q969E3) 


Urocortin III. ~~ " 



Table 25 Down-regulated plasma markers 
Marker 66800, down-regulated on IMAC30 



Short name 


Code 


Annotation 


3BP1_HUMAN 


(Q9Y3L3) 


SH3-domain binding protein 1 


DCE1_HUMAN 


(Q99259) 


Glutamate decarboxylase 


IF3I_HUMAN 


(Q9Y262) 


Eukaryotic translation initiation factor 3 subunit 6 interacting 
protein 


LIB3_HUMAN 


(O75022) 


Leukocyte immunoglobulin-like receptor subfamily B member 3 
precursor 


MAG_HUMAN 


(P20916) 


MyeJin-associated glycoprotein precursor (Siglec-4a). 


ML1X_HUMAN 


(Q13585) 


Melatonin- related receptor (H9). 


NKX3_HUMAN 


(Q9HC58) 


Sodium/potassium/calaum exchanger 3. 


NRD1_HUMAN 


(P20393) 


Orphan nuclear receptor NR1D1 


P2CDJ-IUMAN 


(015297) 


Protein phosphatase 2C delta isoform 


PEX5_HUMAI\I 


(P50542) 


Peroxisomal targeting signal 1 receptor 
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PRLR_HUMAN 


(P16471) 


Pro Is pi" in ro/~P> nfTi r* nron ~" 

• i wiDuuii » cwcjjli-m yjt ccursor 


PYRGJHUMAN 


(P17812) 


CTP synthase 


R1AB_CVHSA 


(P59641) 


Helicase (By similarity). 


5133_HUMAN 


(Q8WWT9) 


Solute carrier family 13, 


SAH3_HUMAN 


(Q96HN2) 


Putative adenosylhomocysteinase 3 


VU47_HHV6G 


(P30005) 


Glycoprotein U47 




Marker ddouu, qo 


wn-regulated on IMAC30 


snort name 


Code 


Annotation 


ArAM_n U MAN 


(P43652) 


Afamin. 


Al nit i ii ih/i a m 

ALBU_HUMAN 


(P02768) 


Serum albumin. 


AN21_HUMAN 


(Q86YR6) 


Ankyrin repeat domain protein 21 


BRL1_EBV 


(P03209) 


Transcription activator BRLF1. 


CALI_HUMAN 


(Q13939) 


Calicin. ~™ ' ' 


CD93_HUMAN 


(Q9NPY3) 


Complement component Clq receptor. 


CDYL_HUMAN 


(Q9Y232) 


Chromodomain Y-like protein (CDY-like). 


FETA__HUMAN 


(P02771) 


Alpha-fetoprotein. 


FPGTJHUMAN 


(014772) 


Fucose-l-phosphate guanyly transferase 


FUT8__HUMAN 


(Q9BYC5) 


Alpha-(l / 6)-fucosyltransferase 


GBP5_HUMAN 


(Q96PP8) 


Interferon-induced guanylate-binding protein 


GDS INHUMAN 


(P52306) 


Rapl GTPase-GDP dissociation stimulator 1 


GRK4JHUMAN 


(P32298) 


G protein-coupled receptor kinase 


MM09_HUMAN 


(P14780) 


type IV collagenase. — 


MOT8JHUMAN 


(P36021) 


Monocarboxylate transporter 8 


NR42__HUMAN 


(P43354) 


Orphan nuclear receptor NURRl 


SNX9_HUMAN 


(Q9Y5X1 


Sorting nexin 9) 


STB2_HUMAN 


(Q15833) 


Syntaxin binding protein 2 


VP40_HHV11 


(P10210) 


Gene UL26 protein. 


VU47JHHV6U 


(Q06093) 


Glycoprotein U47 precursor. 




Marker 66300, dov 


vn-regulated on IMAC30 " ~~~ ' " 


Short name 


Code 


Annotation 


2AAB_HUMAN 


(P30154) 


Serine/threonine protein phosphatase 2A 


ACDV__H U MAN 


(P49748) 


Acyl-CoA dehydrogenase 


AD30_HUMAN 


(Q9UKF2) 


ADAM 30. 


AN21_HUMAN 


(Q86YR6) 


Ankyrin repeat domain protein 


BS69_HUMAN 


(Q15326) 


Adenovirus 5 ElA-binding protein 


CDYL_HUMAN 


(Q9Y232) 


Chromodomain Y-like protein 


ESR1JHUMAN 


(P03372) 


Estrogen receptor 
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EXON_HHV2 
|GDS1_HUMAN 
LAM1_HUMAN 
LCP1_HUMAN 



MOT8_HUMAN 



|N056_HUMAN 
!PP02_HUMAN 



(P06489) 
(P52306) 
(P20700) 
(094842) 
(P36021) 




R1AB_CVH22 



RIB1_HUMAN 



TRI4JHUMAN 
WDR1_HUMAN 



YHL1 EBV 



(000567) 
(Q9UGN5) 
(Q05002) 



(P04843) 



(Q15650) 
(075083) 



(P03181) 



Z430_HUMAN (Q9H8G1) 



Alkaline exonuclease 

Rapl GTPa se-GDP dissociation stimulator 1 
Lamin Bl. 



Nucleolar protein Nop56 
Poly [ADP-ribose] polymerase-2 
Helicase. 




Thyroid recep tor interacting protein 4 
WD-repeat protein 1 




Zinc finger protein 430 




COE3_HUMAN 



DAZ4_HUMAN 



DOPO_HUMAN 



FL01_HUMAN 



GLSLJHUMAN 
HAS1_HUMAN 



HEX3_ADE12 



K2H4_HUMAN 



(Q9H4W6) 



(Q86SG3) 



(P09172) 
(P41440) 



(Q9UI32) 



Transcription factor COE3 




Dopamine beta-monooxygenase. 
Folate transporter 1 



(Q92839) 



KHL3_HUMAN 



KLC1_HUMAN 
LIGAJ-IUMAN 



MGD2__HUMAN 



MPI2_HUMAN 




(Q9UH77) 



(Q07866) 
(P41214) 



(Q9UNF1) 



(P30305) 



NOXl_HUMAN 



(Q9Y5S8) 



Glutaminase, liver isoform. 



Hyaluronan synthase 1 



Kelch-like protein 3. 



Kinesin light chain 1 (KLC 1), 
Ligatin 



Melanoma-associated antigen D2 




IMADPH oxidase homoiog 1 
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5fcN3_HUMAN 


(Q9H4L4) 


SH02__HUMAN 


(Q9UQ13) 


SOAl_HUMAN 


(P35610) 


SVC1_HUMAN 


(Q9UHI7) 


T9S3_HUMAN 


(Q9HD45) 


TAI2_HUMAN 


(Q8WYN3) 


TIP_HUMAN 


(Q8TB96) 



Sentrin-specific protease 3 



Leucine-rich repeat protein SHOC-2 



Sterol O-acyltransferase 1 
Solute carrie r family 23, member 1 
Transmembrane 9 superfamily protein member 3 



TGF-beta induced apoptosis protein 2 
T-cell immunomodulatory protein 



Marker 8931, down-regulated on IMAC30 



[Short name 



APC2__HUMAN 



IIL8JHUMAN 



PLMN_HUMAN 



SLUR_HUMAN 



I SRG INHUMAN 



Code 




(P02655) 



(P10145) 



(P00747) 



(P55000) 



Annotation 
Apolipoprotein C-II. 



Interleukin-8. 



Plasminogen precursor, Activation peptide. 



Secreted Ly-6/uPAR related protein 1. 




Marker 6635, dov\ 


m-regulated 


on IMAC30 ~~ 


IShort name 


Code 


Annotation " — ~ " — ' 


APC1__HUMAN 


(P02654) 


Apolipoprotein C-I. 


CCKfNLHUMAN 


(P06307) 


Cholecystokinin CCK58. 


C07R_HUMAI\I 


(0 14548) 


Cytochrome c oxidase subunit Vila-related protein 


PART_HUMAN 


(Q9NPD0) 


Prostate-specific and androgen regulated protein 


RS30_HUMAN 


(Q05472) 


40S ribosomal protein S30 " ~ 




narKer b435, dow 


n-regulated on IMAC30 ~ 


Short name 


Code 


Annotation — 


E306_ADE35 


(P17591) 


Early E3 6.4 kDa protein. 


GAG__HV1B1 


(P03347) 


Core protein p7. 


GAG_HV1BR 


(P03348) 


Core protein p7. 


GAG_HV1H2 


(P04591) 


Core protein p7. H 


GAGJ-IV1LW 


(Q70622) 


Core protein p7. n 


GAG_HV1PV 


(P03350) 


Core protein p7. ~~ — 


YG02JHUMAN 


(060908) 


Hypothetical 6.4 kDa protein A-363E6.1. " 
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EXAMPLE 19 POSSIBLE IDENTIFICATION OF PLASMA AND SERUM MARKERS BY 
ARTIFICIAL DIGESTION 



In some cases the measured markers correspond to the theoretical mass of a protein in 
5 the database (for example the Swiss-Prot database for human proteins) in other cases no 
significant hit can be obtained (there is no protein with a theoretical mass within for 
example 0.2% of the identified mass of the marker). There could be a number of reasons 
for this: the database is not complete (databases are continually being updated), the 
identified mass is a protein with post-translational modifications (these modifications add 

10 to the final mass, and are never accounted for in the database), the identified mass is not 
a mass of a full length protein, but a fragment of a protein (there is an almost Infinite 
number of fragments for every protein and these are not accounted for in the database). If 
the identified mass corresponds to a fragment of a marker, a possible identification can be 
obtained by so called "artificial digest" or "in silico digest" of a protein of interest. In this 

15 procedure the sequence of a protein is pasted into a digestion program. This program then 
cleaves the sequence into specific fragments and calculates the mass values of these 
fragments. Some of these mass values may correspond to the measured mass values of 
the markers. This fragment may be an identification of the marker. However, there are 
more than one hundred thousand protein sequences in the database, which in theory each 

20 produces an infinite amount of fragments. Our screening was done on blood samples 
(serum or plasma), therefore we focused solely on a few common blood proteins. 

Methods 

The protein sequence was obtained from the NCBI Entrez Protein Bank In fasta format. 
25 The sequence was digested by "PeptideMass" on the ExPASy server. 
The following parameters were chosen: 
Mass value: [M], average. 

Enzyme: Trypsin (higher specificity) 

Allowed missed cleavage sites: 5 



30 



We have chosen Trypsin (higher specificity) based on the assumption that most proteases 
in blood are members of the trypsin-familiy of proteases. 

The program allows for a maximum of 5 missed cleavage sites. This means that fragments 
35 of proteins that contain more than 5 cleavage sites will not be presented. Fragments 
containing more than 5 cleavage sites are however possible. 
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Results: 

In some cases the measured markers correspond to the theoretical mass of a protein in 
the database (for example the Swiss-Prot database for human proteins). We have 
5 artificially digested the following common blood proteins: 

Human Serum Albumin (P02768), Haptoglobin (P00738), Alpha 2 Macroglobulin (P01023), 
C2 Complement (P06681), C3 complement (P01024) 

In some cases the measured markers correspond to the theoretical mass of a protein in 
the database (for example the Swiss-Prot database for human proteins). 



Table 26 Possible hits of up and down-regulated plasma markers 
Human Serum Albumin I I ~~ 



Possible hits 


; of up-regulated markers: 5920, 5900, 5330, 4460 


mass 


position 


#MC 


peptide sequence 


59307162 


21-73 


3 


ALVUAFAQYLQQCPFEDHV KLVN EVTEFA KTC VA D E S A E 
NCDKSLHTLFGDK (SEQ ID NO 12) 


59046970 


18629 


5 


DAHKSEVAHRFKDLGEENFK ALVUAFAQYLQQCPFEDHV KLVNEVTEFAK 
(SEQ ID NO 13) 


53309633 


476-521 


3 


CCTESLVNRRPCFSALEVDE TYVPKEFNAETFTFHADICT LSEKER( SEQ ID 
NO 14) 


44591434 


501-538 


5 


EFNAETFTFHADICTLSEKE RQIKKQTALVELVKHKPK (SEQ ID NO 15) 




Haptoglobin ■ — 


Possible hits of up-regulated markers: 9140, 5330, 2955 


mass 


[position 


#MC 


peptide sequence ~ " 


91321617 


298-379 


5 


YVMLPVADQDQCIRHYEGST VPEKKTPKSPVGVQPILNEH 
TFCAGMSKYQEDTCYGDAGS AFAVHDLEEDTWYATGILSF DK(SEQ ID 
NO 16) 


53311397 


298-345 


4 


YVMLPVADQDQCIRHYEGST VPEKKTPKSPVGVQPILNEH TFCAGMSK 
(SEQ ID NO 1) 


29605032 


252-277 


4 


LKQKVSVNERVMPICLPSKD YAEVGR (SEQ ID NO 18) 




Alpha 2 macroglobulin ™ " — " — 


possible hits 


of up-regulated markers: 10830, 8930, 5900, 5330, 2960 


108473269 


935-1031 


5 


LPPNVVEESARASVSVLGDI LGSAMQNTQNLLQMPYGCGE 
QNMVLFAPNIYVLDYLNETQ QLTPEVKSKAIGYLNTGYQR 
QLNYKHYDGSYSTFGER (SEQ ID NO 19) 


89304246 


429-507 


4 


SPCYGYQWVSEEHEEAHHTA YLVFSPSKSFVHLEPMSHEL 
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PCGHTQTVQAHYILNGGTLL GLKKI SFYY1 TMAkT^TWD rccn tr\ Mr\ ->n\ 


59060717 


94-145 


5 


SSS N E EVM FLTVQVKG PTQE FKKRTTVMVKNEDSLVFVQT 
DKSIYKPGQTVK (SEQ ID NO 21) 


53335611 


688-733 


4 


MCPQLQQYEMHGPEGLRVGF YESDVMGRGHARLVHVEEPH TETVRK 
(SEQ ID NO 22) 


29612771 


1449-1474 


1 


VYDYYETDEFAIAEYNAPCS KDLGNA (SEQ ID NO 23) ~~ 


• 


Possible hits 


of down-regulated markers: 4660, 4290, 


46600966 


272-312 


3 


YSDASDCHGEDSQAFCEKFS GQLNSHGCFYQQVKTKVFQL K (SEQ ID 
NO 24) 


42903594 


1082-1122 


1 


SSGSLLNNAIKGGVEDEVTL SAYITIALLEIPLTVTHPVV R (SEQ ID NO 
25) 




C2 complement 


Possible hits 


of up-regulated markers: 8930, 2960 . " — 


89241288 


78-162 


5 


SLSKAVCKPVRCPAPVSFEN GIYTPRLGSYPVGGNVSFEC 

EDGFILRGSPVRQCRPNGMW Dbbl AVCDNGAGHCPNPGIS LGAVR (SEQ 
ID NO 26) 


29595139 


717-740 


4 


APRSKVPPPRDFHINLFRMQ PWLR (SEQ ID NO 27) 




Possibel hits 


of down-regulated markers: 6660, 4290 


66636239 


63-124 


b 


LCKSSGQWQTPGATRSLSKA VCKPVRCPAPVSFENGIYTP 
RLGSYPVGGNVSFECEDGFI LR (SEQ ID NO 28) 


42867414 


167-205 


4 


FGHGDKVRYRCSSNLVLTGS SERECQGNGVWSGTEPICR (SEQ ID NO " 
29) 




Complement C3: ~~ " 


Possible hits 


of up-regulated markers: 9140, 6090, 5900, 5540, 5330, 4460, 2960 


91396841 


1073-1155 


5 


APSTWLTAYWKVFSLAVNL IAIDSQVLCGAVKWULEKQ — 

KPDGVFQEDAPVIHQEMIGG LRNNNEKDMALTAFVLISLQ EAK (SEQ ID 
NO 30) 


60898032 


208-258 


3 


AYYENSPQQVFSTEFEVKEY VLPSFEVIVEPTEKFYYIYN EKGLEVTITAR 
(SEQ ID NO 31) 


59014994 


623-678 


4 


ADIGCTPGSGKDYAGVFSDA GLTFTSSSGQQTAQRAELQC 
PQPAARRRRSVQLTEK (SEQ ID NO 32) 


55414229 


156-205 


2 


LLPVGRTVMVNIENPEGIPV KQDSLSSQNQLGVLPLSWDI PELVNMGQWK 
(SEQ ID NO 33) 


53329345 


1304-1351 


5 


SSKITHRIHWESASLLRSEE TKENEGFTVTAEGKGQGTLS WTMYHAK 
(SEQ ID NO 34) 


44682808 


137-176 


3 


TIYTPGSTVLYRIFTVNHKL LPVGRTVMVNIENPEGIPVK (SEQ ID NO 35) 
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44539780 


1392-1431 


3 


YRGDQDATMSILDISMMTGF APDTDDLKQLANGVDRYISK (SEQ ID NO 
36) 


29583902 


1498-1522 


4 


EDGKLNKLCRDELCRCAEEN CFIQK (SEQ ID NO 37) 


29572528 


1285-1310 


2 


DAPDHQELNLDVSLQLPSRS SKITHR (SEQ ID NO 38) 




Possibel hits of down-regulated markers: 6880, 4660, 4290 


68838812 


980-1041 


3 


ILLQGTPVAQMTEDAVDAER LKHLIVTPSGCGEQNMIGMT 
PTVIAVHYLDb 1 bQWEKFGL EK (SEQ ID NO 39) 


46624092 


1204-1244 


5 


GPLLNKFLTTAKDKNRWEDP GKQLYNVEATSYALLALLQL K (SEQ ID NO 
40) 


46593269 


1002-1042 


2 


HUVTPSGCGEQNMIGMTPT VIAVHYLDETEQWEKFGLEK R (SEQ ID NO 
41) 


42967974 


206-241 


2 


IRAYYENSPQQVFSTEFEVK EYVLPSFEVIVEPTEK (SEQ ID NO 42) 
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